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FOREWORD 


The 1960 Western Joint Computer Conference met in San Francisco for the third 
time. Interest in these conferences has been increasing steadily along with the impres- 
sive growth of the computer industry in the San Francisco Bay area. Manufacturing rep- 
resentation in the form of exhibits was the greatest in the history of the Western Joint 
Computer Conference. 

In keeping with the theme of the 1960 Conference, "The Challenge of the Next 
Decade," the papers presented in the Proceedings were selected on the basis of trends 
in techniques and applications rather than descriptions of existing or about-to-be an- 
nounced equipment. The philosophy of this year's conference committee was that of 
"taking stock" and attempting to "look ahead” to determine which way this dynamic in- 
dustry is going. We have essentially reached a plateau where technology has given us 
a feel for the tremendous impact that this industry will have upon our everyday life. 

It seems desirable, occasionally, to pause and evaluate and, indeed, this in itself can 
justify the value of a conference. 

This publication contains the papers presented at the 1960 Western Joint Computer 
Conference, and was available at the time of the Conference. 


R. M. Bennett 
General Chairman 

1960 Western Joint Computer Conference 
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1.1 

THE HISTORICAL DEVELOPMENT, AND PREDICTED STATE-OF-THE-ART 
OF THE GENERAL-PURPOSE DIGITAL COMPUTER 

By Charles P. Bourne and Donald F. Ford 
Computer Techniques Laboratory 
Stanford Research Institute 
Menlo Park, California 


Summary 


Some of the important characteristics of all 
the general-purpose digital computers that have 
ever been built, or are in the process of being 
built, have been collected together in order to 
show the changes in performance and characteristics 
during the passage of time. These collected data, 
as well as information regarding recent development 
work, have been used to extrapolate the character- 
istics and performance figures into the 1960-1965 
era. The report considers such characteristics as 
add and multiply times, memory characteristics, 
pulse repetition rates, and internal system para- 
meters . 

The collected data seems to suggest that the 
majority of the computers developed between now and 
1965 will show very little change in performance 
from that which was obtained during the last five 
years. However, a few research machines will defi- 
nitely advance the technology, possibly as much as 
one order of magnitude for some of the characteris- 
tics. In appendix B there is a listing of approxi- 
mately 300 different computers, in an attempt to 
provide an initial directory of the world's 
computers . 

Introduction 

This paper presents the results of a study 
that was conducted to answer the question, "What 
can be learned from a historical study of the dev- 
elopment of general-purpose digital computers?" It 
was of interest to re-trace, in summary fashion, 
the development of the digital computer. It was 
also of interest to see if the development of any 
of the machine characteristics followed a pattern 
which would allow an extrapolation into the future. 
The study was restricted to the characteristics 
which describe the memory and central processor or 
arithmetic units, and did not consider input-output 
features . 

The characteristics were examined from the 
viewpoint of a machine user, and not a machine de- 
signer or component specialist. That is, an at- 
tempt was made to describe the operational perfor- 
mance which the computer system provides the user, 
instead of concentrating on the details of the 
manner in which the logic is accomplished. For 
example, a comparison of typical execution times 
was studied, instead of looking at the switching 
times for the individual circuits. In a few 
instances, a study was made of some of the more 
hardware-oriented characteristics such as the pulse 
repetition rate or "clock" rate, and the type of 
high-speed memory component used . 


The machine characteristics Y e £ e obtained 
from several summary publications ’ , as well as 
numerous journal articles and the literature of 
manufacturers. The data for approximately 180 of 
the machines was checked by letters of verifica- 
tion from the equipment designers or manufacturers . 

The machine characteristics which were exa- 
mined are listed below. The definitions for these 
characteristics are included in Appendix A. 

Number of addresses per instruction 

Number of index registers 

Number of decimal digits per instruction 

Number of decimal digits per data word 

Number of binary bits per instruction 

Number of binary bits per data word 

Internal clock rate 

Fixed point add time 

Fixed point multiply time 

Floating point add time 

Floating point multiply time 

Number of memory speed levels 

Types of high speed memory 

High speed memory size 

High speed memory access time 

Total internal memory size 

Summary Comparisons of Computers to D ate 

Each of the machine characteristics was 
plotted against a horizontal calendar scale in 
order to observe the changes and rates of change 
for the parameters. A single point was plotted 
for each different machine type to show the 
earliest date that the system performed with the 
described characteristics. Points were plotted 
for all the available data, however, in many cases, 
various parameters were not described in the lit- 
erature. For this reason, all the plots do not 
contain the same number of points. 

The actual growth curve to show the rate at 
which new computer models have been developed is 
shown in Fig. 1. Fig. 2 shows the cumulative 
number of different computers which have been 
developed. In addition to the computers which are 
shown on Fig. 1 and 2, approximately 65 machines 
were not plotted, since an accurate operational 
date could not be determined. It can be seen that 
new computer models are being developed in contin- 
ually increasing numbers, and that this trend will 
probably continue into the future. The increasing 
number of new computer models is due to several 
factors, but it would appear that the major reasons 
are those which are listed below: 
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1. Parallel efforts by commercial organiza- 
tions acting in the spirit of free enter- 
prise, with each organization competing 
for a share of the market with its own 
particular model. This is fostered by the 
increasing market for computers. 

2. Continued marketing pressures to improve 
the performance characteristics of a par- 
ticular computer model, or to produce more 
competitive equipment. 

3 . Efforts by universities and some industri- 
al concerns to build their own computers 
in order to develop a technical competence 
in this field, or to obtain an inexpensive 
computing facility. 

4. The emergence of a computer industry in 
areas which were late in starting in this 
field. For example, Japan built no digi- 
tal computers before 1953 ; since that time 
however, she has developed 29 different 
computers, and shows every indication of 
developing more. The European countries 
are also developing rapidly as computer 
manufacturers . 

Internal Characteristics 

There has been very little uniformity or in- 
dication of trends for such internal characteris- 
tics as the number of addresses per instruction 
and the number of characters per data word or in- 
struction. There have been some slight trends 
noticed in the number of index registers and the 
internal clock rate. 

1. Number of Addresses per Instruction - Figure 3 
shows that the single-address instruction is 
used more than any other type, although a 
large number of systems have used two-and- 
three-address systems. The four-address sys- 
tems are definitely a minority; however, 
there is one instance of a five-address sys- 
tem. Future computers will probably follow 
the same pattern, with predominantly single- 
address format. 

2 . Number of Index Registers - Figure 4 shows 
that there is a definite trend toward pro- 
viding index registers (or "B-boxes") in in- 
creasing numbers for each computer. The use 
of a single index register is first noted in 
1951. The first system with multiple index 
registers is noted in 1954. From that time 
on, systems became available with a greater 
number of registers, and in some cases this 
went as high as 64, 99, and 1024 index regis- 
ters per machine . A large percentage of the 
future computers will probably have at least 
one index register. Index registers have pro- 
ven to be effective for applications in both 
business and scientific computations, and will 
probably appear more frequently on both types 
of systems. 


3 . Decimal Digits per Instructio n - Figure 5 
shows the data for the non-binary machines, 

to indicate the number of characters (decimal, 
octal, or alphabetic) per instruction. There 
does not appear to be any significant degree 
of uniformity, although more systems use 10 
digits than any other number. Future compu- 
ters will probably show the same large vari- 
ety and lack of uniformity. 

4. Decimal Di gits per Data Word - Figure 6 shows 
a large spread of values running from 4 to 24 
digits. However, data words with 10, 11, or 
12 digits appear to be the predominant choice. 
In 1953 the concept of a variable-length data 
word was introduced, and several systems 
utilized this feature after that. Future 
computers will probably continue to show a 
variety of values, but may use the 10, 11, or 
12 digits more often than any other choice. 
There will probably be an increasing number 

of machines with variable-length data words. 

5. Binary Bits per Instruction - Figure 7 shows 
the binary bits per instruction ranging from 
4 to 68. There is a decided lack of unifor- 
mity, and no strong tendency toward a parti- 
cular value. Future computers will undoub- 
tedly follow the same large variety and lack 
of uniformity. 

6. Binary Bits per Data Word - Figure 8 shows 
the binary bits per data word ranging from 4 
to 72. However, the majority of the systems 
have ranged between 30 and 50 bits per data 
word. Future computers will probably follow 
the same large variety and lack of uniformity. 

7. Internal Clock Rate - Figure 9 shows the in- 
ternal clock rate or pulse repetition rate. 
This is not a very good measure of the speed 
or power of a particular computer. This is 
because the great variety of ways in which 
the logic can be implemented (serial, paral- 
lel, and various combinations of serial- 
parallel) can provide a large range of effec- 
tive operating speeds . There are several 
examples of machines with nearly equal arith- 
metic speeds in spite of the fact that one 

of the machines has a clock rate which is 
five times slower than the clock rate of the 
other machine. Conversely, it might be noted 
that all of the IBM-700 series machines from 
1953-1957 used a 1.0 Me clock, even though 
there were marked differences in execution 
times. The clock rate data was included here 
to give an indication of the speeds of opera- 
tion of the internal circuits. The fastest 
clock rate (10.0 Me) is currently credited to 
the IBM-STRETCH, or Los Alamos compu^e^, 
although hardware has been developed ’ which 
operates at speeds up to 500 Me . The data 
shows, among other things, that people like 
round numbers, as indicated by the large num- 
ber of 100 Kc and 1 Me systems . The clock 
rates ranged from 200 cycles per minute to 
10.0 Me. 



There are a moderate number of asynchronous 
systems (at least 28) which date back to the 
earliest machines . Because the asynchronous sys- 
tems offer an inherent speed advantage over the 
clocked systems, they will probably find increasing 
use as better design techniques become available. 
There will be an increasing number of systems which 
use clock rates of 1 Me or higher. There are al- 
ready at least 47 systems which operate with pulse 
rates of 1 Me or higher. These internal speeds can 
now be achieved fairly easily with current tran- 
sistor and diode circuitry. However, because of 
the variety of possible applications, there will 
continue to be a large number of new systems with 
moderate internal speeds (100 - 500 Kc) . 

Arithmetic Speeds 

As defined in the Appendix, the execution 
times for the add and multiply operations are the 
effective operating times which a programmer would 
use in estimating the running time of a particular 
program. It can be seen from the pertinent figures 
(Fig. 10, 11, 12, 13) that there continues to be a 
wide range of execution times for all computers. 
This wide range of machine speeds, and characteris- 
tics, does not solely reflect the state-of-the-art 
or its trends. This range of values more accur- 
ately reflects the policy decisions and compromises 
which were made by each computer builder in his 
attempt to aim for a particular part of the commer- 
cial market, or produce (at minimum cost) a tech- 
nical staff or computing facility. It is probably 
reasonable to state that not every builder tried to 
advance the state-of-the-art. Because of the fact 
that the collected data represented a heterogenous 
collection of intents and compromises, it was felt 
that it would be misleading to represent this data 
in some mathematical notation such as a yearly 
average or polynomial approximation. 

The computers which have served to extend the 
state-of-the-art were primarily research and devel- 
opment machines, and were not designed primarily 
for commercial exploitation. The distinction m 
performance between systems which were built for 
research and systems which were built for commer- 
cial exploitation is shown in Fig. 14. This figure 
shows the fixed point multiply times for the 
machines in these two categories, and indicates 
that the state-of-the-art improvements were fur- 
nished entirely by the research machines. However, 
the commercial machines usually caught up with the 
fastest research machines in a matter of approxi- 
mately five years. 

Many builders have claimed the title of 
"world's most powerful computer" for their parti- 
cular machine. It is impossible at this dite to 
define accurately and unambiguously "computer- 
po,ver". However, in an attempt to find out how 
long a machine could expect to retain this title, 
it was assumed that "computer-power" was propor- 
tional to the multiply time. With this assumption, 
it was a simple matter to determine the ten most 
powerful computers in each year from 1944 to the 
present. It was indeed an interesting observation 
to note that, on the average, a computer did not 
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remain in the "top-ten” for more than three years. 

For many reasons, there will be a continued 
effort to develop computers with faster execution 
speeds. However, it appears that faster speeds 
will not be achieved by extensions of the current 
hardware practices and techniques. It appears 
that new components and approaches must be devel-^ 
oped if these speeds are to be attained. Meagher 
sums this up very nicely in the following para- 
graph . 

"Let us first examine the need for 
new techniques which has resulted from 
our desire for higher speed. The 
existing circuits, with separate resis- 
tors, diodes, capacitors, and transis- 
tors, have a physical size which re- 
quires at least one cubic inch for a 
logical element. One such circuit has 
within itself a loop which constitutes 
an inductance with a shunt capacitance 
in the switching element . This LC cir- 
cuit exhibits resonance. If the loop 
is about one-half inch in diameter, the 
inductance would be about 0.06 micro- 
henry, and further, if the capacitance 
is 5 micro-micro-farad (both reasonable 
minimum values) , the resulting resonant 
frequency would be about 300 Me . 

Clearly it would be difficult to operate 
this circuit at an information frequency 
of more than about one-fifth the reson- 
ant frequency, or in other words, 60 Me. 

We are already close to this frequency 
in some present computer circuits. Thus, 
faster circuits require either smaller 
size for "lumped-constant" techniques, 
or, alternatively, "distributed-constant" 
techniques. Low temperature circuit 
elements offer one possibility for ex- 
tremely small size. Micro-wave tech- 
niques, the subject for the Symposium, 
offer the possibility of the distributed- 
constant approach." 

To date, all of the circuits for storage and 
logic in the operating computers have employed a 
frequency band for pulse rates which starts at or 
near zero and extends to some upper limit. The 
upper limit for this base-band type of system 
appears to be about 50 Me (see reference ) . It 
would appear that some new techniques would have 
to be devised to achieve information pulse rates 
of greater than 50 or 100 Me . 

Microwave circuits for storage and logjg 
functions have been demonstrated by Sterzer 
which operate at pulse rates of 100 Me and | 
carrier frequency of 2,000 Me; and by Ortel which 
operate with pulse rates of 500 Me and a carrier 
frequency of 11,000 Me. Ortel demonstrated a 
serial multiplier operating at a clock frequency 
of 160 Me which multiplied two 8-digit binary num- 
bers to form a 16-digit product in 1.6 micro- 
seconds. It was stated that it would be feasible 
to use the same microwave circuit with a clock 
frequency of 640 Me to obtain a multiplication 
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g 

Billing and Rudiger have stated that the use 
of the nonlinear capacitance of semiconductor 
diodes in parametron circuits also appear promising 
for high frequency operation. Experimental work 
has been conducted at frequencies up to 450 Me, and 
theoretical studies seem to indicate that frequen- 
cies in the neighborhood of 30 Kmc should be 
possible . 

Aside from economic problems, one of the 
major technical problems in achieving very high 
circuit speeds is that the time allocated to 
switch a signal through the circuit approaches the 
propagation time for any 7 electrical wave. This is 
stated very well by Leas in the following para- 
graphs . 

"To improve substantially upon the 
present computing speeds, manipulative 
elements for the basic functions of 
gating and storing binary signals are 
required which have extremely fast 
physical response, reckoned in nano- 
seconds (10 ) rather than microseconds 

(10 ) . Furthermore, because large 

numbers of such elements must be used 
in systems sufficiently comprehensive 
to make significant use of their speed, 
these elements must be physically small 
in order that the machine itself be 
small enough not to cause prohibitive 
delays due to the finite propogation 
velocity of electrical signals. 

"For example, during one nanosecond 
, electrical signals in free space travel 
one foot (30 cm) , and in most solid 
materials only 6 to 8 inches (15 to 20 
cm) . This means that two circuits must 
be only fractions of an inch apart if 
the delay between them is to be negli- 
gible. Furthermore, the whole computer 
size is limited since the performance of 
an instruction requires information to 
reach and be' returned from all parts of 
the computer. To obtain memory cycles 
of 10 mi Hi -microseconds , the computer 
can not be more than 18 to 24 inches 
(45 to 60 cm) in diameter." 

In addition to the problem of physical size, 
Leas also mentions the problem of power consump- 
tion of the individual logic elements. 

"This limitation in size also sets 
limitations on the power consumption of 
individual elements . For a computer of 
minimum size, at least 5000 logical ele- 
ments and 10,000 memory elements will be 
required, implying power consumptions 
below 500 mw for each logic element, and 
50 mw for each memory element, if normal 
forced air cooling is used. It is highly 
desirable that these figures be reduced 
to increase the capability of the computer. 
Satisfactory values would be 15,000 logic 


elgments, each taking 150 mw, and 2.5 x 
10 memory elements, each taking 2 mw." 

One additional drawback to improving the exe- 
cution times is the lack of a very high speed 
random access storage . Unless some "look-ahead" 
or sophisticated control techniques are used, the 
lack of a high speed memory will prevent the 
effective execution speeds from being achieved. 
Temporary solutions may be obtained by using a 
small amount of very fast memory in a multi-level 
memory. Further impediments to widespread use of 
high speed operation are the lack of inexpensive 
instrumentation equipment, the scarcity of techni- 
cal and professional manpower with training in 
this area, and the lack of production and testing 
experience . 

1. Fixed Point Add Times - The fastest fixed 
point add time (0.57 microsec) is currently 
credited to the University of Illinois 
ILLIAC-2. It might also be noted that after 
1950 all of the computers that claimed the 
title of "world's fastest adder" were binary 
machines. A computer with an add time of 
less than 0.1 microseconds will probably not 
be built before 1965, even though the hard- 
ware techniques are already available to 
build adders to operate at even faster speeds. 
Logic circuits and adders have been operated 
at clock rates of 50 Me, and even up to 

500 MCgUSing radio frequency carrier tech- 
niques , but the components and circuitry 
are currently so expensive and bulky that 
they will probably not be used as part of a 
complete computer system. 

2. Fixed Point Multiply Times - The fastest 
multiply time (1.4 microseconds) is credited 
to the IBM-STRETCH, or Los Alamos machine. 

As with the add times, the fastest machines 
are predominantly binary machines. A com- 
puter with a multiply time of less than 0.5 
microseconds will probably not be built 
before 1965. Prototype multipliers ha^e^ 
operated at 50 and 160 Me clock rates ’ , 
but these devices will probably not be incor- 
porated into a complete computer system for 
some time. 

3 . Floating Point Add Times - Figures 11 and 13 
illustrate the systems which are known to 
utilize floating point hardware (binary or 
BCD) . After 1958, more and more systems 
incorporated floating point arithmetic cir- 
cuitry, and the execution speeds increased 
in the same manner as for the fixed point 
operations . The fastest floating point add 
time (0.8 microsec) is currently credited to 
the IBM-STRETCH machine. Floating point 
arithmetic has proven to be a useful feature 
both for scientific and business computers. 
There will be an increasing need for 
floating point hardware in business computers 
as more users employ mathematical tools and 
operations research techniques for business 
applications . The instances where a single 
machine is used for both business and 
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scientific computations will also press the 
need for floating point hardware in any 
general-purpose machine at least as an 
optional feature. For future computers, it is 
expected that a greater percentage will in- 
corporate floating point hardware and the 
speeds will generally increase, but a computer 
with a floating point add time of less than 

0.1 microseconds will probably not be built 
before 1965. 

4. Floating Point Multiply Times - Figure 13 

shows the comparative floating point multiply 
speeds . The wide pattern with some increas- 
ing speeds is generally the same as for all 
the other execution speeds. The fastest 
floating point multiply time (1.4 microsec) 
is credited to the IBM-STRETCH machine. A 
computer with a floating point multiply time 
of less than 0.5 microseconds will probably 
not be built before 1965. 

Internal Memory 

1 . Number of Memory Speed Levels - In many com- 
puters the internal working memory consists 
of a combination of different types of memory 
devices with different speeds of access. For 
example, there are machine designs in which a 
small amount of core storage is backed up by 
additional drum storage, or a few high speed 
drum bands are backed up by a large number of 
slower bands. In many cases, the slower 
storage media is not directly addressable, and 
provisions are made to transfer data to and 
from the high speed storage in some modular 
quantity such as 20 or 100-word blocks. A 
homogenous, directly-addressable memory is 
desirable from a programmer's viewpoint, 
since the programs which use multi-level stor- 
age are necessarily more complex and less 
efficient than programs which utilize single- 
level storage. However, for large memories, 
the multi-level storage does provide a reason- 
able compromise between the expense and per- 
formance of high-speed and low-speed systems. 
Figure 15 illustrates the number of systems 
which have used 1-, 2-, 3-, or 4-level mem- 
ories . Single level systems are more numer- 
ous than any other type. For future machines, 
it is expected that the single level machines 
will continue to dominate the scene, but there 
will still be many multi-level systems. 

2 . Types of High Speed Memory - Figure 16 
describes the types of devices which have been 
used for the high speed memory. A definite 
pattern of emergence and de-emphasis can be 
noted for most of the devices . The various 
types of high speed memory — relay, vacuum 
tube, delay line, drum, cathode ray tube, 
magnetic core, disk, and diode-capacitor — 
made their operational appearance in that 
order . 

It would appear that the relay, vacuum tube, 
delay line, and cathode ray tube memories are 
on the way out. From now until 1965, the 


drum and core memories will be the devices 
which will appear most frequently as the 
high speed memory devices. There will be axjL Q 
increased application of thin magnetic films 
for memory applications in the immediate 
future and some systems with large (1000- 
5000 words) thin film memories should be 
operating before 1965. A small section (32 
words) of film memory has already 
with the Lincoln Lab TX-2 computer ’ ’ , 

but it has been used as part of the control 
circuitry and not as part of the addressable 
memory . 

3 . High Speed Memory Size - In order to examine 
and compare the memory sizes of the single- 
level along with the multi-level memory 
systems, it was decided to look at the size 
of the highest speed memory as well as the 
size of the largest possible total internal 
memory size. Figures 17 and 18 illustrate 
the size of the high speed memory, which is 
always less than or equal to the total 
internal memory size. Figure 17 distinguish- 
es between binary and non-binary memories, 
and Figure 18 distinguishes between the types 
of device used as the memory element . The 
maximum or extreme size of high speed memory 
is generally increasing. However, if the 
top and bottom extreme points are removed 
from either of these figures, the range of 
values actually shows very little change over 
the last decade. Another feature which is 
changing (see Fig. 18) is the type of device 
used for the high speed memory. This is in 
agreement with the data from the preceding 
section on types of high speed memory. 

Future computers will probably fall in the 
same range described in the figure, but 
there may be an occasional system which will 
show a larger high speed memory size, per- 
haps to 20 million bits. This figure is a 
little misleading since this size of memory 
could almost be achieved by a single magnetic 
drum. However, there will be an increasing 
number of systems with large, directly-addres- 
sable high speed memories . 

4. High Speed Memory Access Time - Figure 19 
shows the high speed memory access times, and 
distinguishes between the types of devices 
used as the memory element. This figure 
shows that with a few exceptions, the state- 
of-the-art of memory access time has not 
shown as rapid a development as. some of the 
other operating speeds. Access times on the 
order of 10 microseconds have been achieved 
with computers since 1950. The credit for 
the fastest memory access time currently be- 
longs to the University of Illinois ILLIAC-2, 
which achieves a 0.2 microsecond access time 
with a small section of high speed transistor 
flip-flop buffer memory. Most of the fast 
access times have been achjgved with magnetic 
cores. The probable limit to core switching 
speeds is 0.1 microseconds, and although the 
probably limit to thin film switching speeds 
is 0.01 microseconds, future computer systems 
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References 


will probably not achieve an access time of 
less than 0.05 microseconds before 1965. 

The higher memory speeds in the immediate 
future will probably be obtained with thin 
magnetic films, transistor flip-flops, and 
diode-capacitor memj^ies . Tunnel diodes have 
also been mentiongd as ^evices for high 
speed access (10 to 10 sec), but they will 
probably not be available in an operational 
computer before 1965. 

5. Total Memory Size - Figure 20 shows the total 
amount of internal memory which is available 
for each computer. There is a large spread 
of values, and a general increase in the maxi- 
mum sizes, but the average size of the memory 
has remained about the same for the last ten 
years. Future systems will probably fall in 
the same range described by the figure. There 
will be an increasing use of large, random- 
access devices such as the disk files or drum 
files. A computer system which utilizes 
photoscopic storage for high density permanent 
storage (such as the glass disk memory devel- 
oped for the Air Force machine translation 
projects) will probably be demonstrated before 
1965. 

Discussion 

An examination of many of the computer charac- 
teristics failed to disclose any significant 
trends. The number of bits or digits per instruc- 
tion or data word is an example of this lack of 
definite trend. For many characteristics, it 
appears that the majority of machines have utilized 
approximately the same state-of-the-art, or have 
copied each other to a large extent, and that a 
relatively small group of machines have provided 
the significant technical advances. It is also 
interesting to note that very few of the machines 
which were developed for commercial exploitation 
really provided any major advances in the charac- 
teristics studied in this paper. It would appear 9. 

that if there is an interest only in predicting 
the extensions of the state-of-the-art, then this 
can probably be accomplished primarily by looking 
at the small number of developmental and feasi- 
bility machines in addition to studying the pro- 
gress which is being reported on circuit and com- 
ponent development . 

The collected data seems to suggest that the 
majority of the computers developed between now 
and 1965 will show very little change in perfor- 
mance from that which was obtained during the last 
five years. However, a few research machines will 
definitely advance the technology, possibly as 
much as one order of magnitude for some of the 
characteristics, such as multiply times, memory 
sizes and memory access times. 
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Appendix A 

Definitions 

Computer 

This is the name of the computer model being des- 
cribed. If significant changes have been made in 
the original model, then the new computer config- 
uration is described as an additional entry (e.g., 
IBM 705 Mod. 1, IBM 705 Mod. 2, IBM 705 Mod. 3). 
Whenever possible, the manufacturer's or builder's 
name precedes the computer's common name. 

Operational Date 

This is the date on which the computer model was 
operating as a complete system. This may not be a 
clearly defined date, but represents an approxi- 
mation to the date at which a system with the 
described parameters was shown to be an actuality. 

Number of Addresses per Instruction 

This indicates the number of addresses which are 
included in a single instruction. This includes 
addresses of the next instruction, as well as 
addresses for the data being operated upon, and in 
some cases the addresses of addressable registers. 

Binary (decimal) Digits Per Instruction (data word ) 

This indicates the number of functional digits per 
instruction and/or data word. The sign digit is 
included in this count, but not parity digit. If 
the data word can have 2 or 3 different sizes, the 
maximum size is considered. 

Number of Index Registers 

This is the number of special ("B-box") registers 
which can perform automatic address modification 
to the command which is currently being inter- 
preted and executed. 

Clock Rate 

This is the internal pulse repetition rate or clock 
rate of the computer. 


This is the time required to execute an add 
command under optimum programming conditions. 

The time required for memory accesses are includ- 
ed in this number. Ten-digit operands were cho- 
sen for those machines which used a variable- 
length data word. If a range of speeds was given 
with no other explanation, an arithmetic mean of 
the range was chosen as the add time. It was 
assumed that no indirect addressing or address 
modification occurred at the time that the command 
was being executed . 

Multiply Time 

This is the time required to execute a multiply 
command under optimum programming conditions. 

The time required for memory accesses are includ- 
ed in this number. Ten-digit operands were cho- 
sen for those machines which used a variable- 
length data word. A word of "fives" was chosen 
for those cases where the execution time was a 
function of the value of the operand digits. If 
a range of speeds was given with no other explan- 
ation, an arithmetic mean of the range was chosen 
as the multiply time. It was assumed that no 
indirect addressing or address modification 
occurred at the time that the command was being 
executed . 

Floating Point Add (Multiply) Time 

This is the time required to execute a floating 
point command with floating point hardware. The 
execution times are not included for the machines 
which have to perform floating point arithmetic 
by subroutine. The general assumptions listed for 
the add and multiply times are also applicable 
here. If the floating point times are the same as 
the fixed point times, it is probably because the 
machine is basically a floating point machine and 
an entry was made in both the fixed and floating 
columns . 

Number of Speed Levels of Memory 

This is the number of different speeds of inter- 
nal memory which are present in a computer. This 
number does not include auxiliary storage (magne- 
tic tapes, drums, etc.) which is not essential to 
the operation of the basic machine. The amount 
and access time of the highest speed memory of 
each machine are described in separate tabulations. 

High Speed Access Time 

This is the time required to read a word (10 BCD 
characters if not otherwise specified) from the 
highest speed memory of the machine under optimum 
programming conditions. If a range of access 
times is quoted with no further explanation, then 
an arithmetic mean of the access time was used. 

Type of High Speed Memory 

This is the type of the highest speed memory used 
for each machine. 
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Total Memory 


This is the total amount of available memory (bits 
for binary machines, characters for BCD, bi-quinary, 
or alpha-numeric machines) for all levels of dir- 
ectly-addressable memory, exclusive of auxiliary 
storage (magnetic tapes, drums, etc.) 

Size of High Speed Memory 


This is the total amount of available memory with 
the fastest access time. 

General-Purpose, Stored-Program Computer 


This is a machine which has a stored program (by 
plugboard, pinboard, or internal memory), and can 
perform program iterations and modify its own 
stored commands. (A few machines which do not 
completely satisfy this requirement are included in 
this report because of their historical or techno- 
logical significance.) 



APPENDIX B 


9 

1.1 


AIL, INC. , MODAC 5014 
AIL, INC. , MODAC 404 
AIL, INC. , MODAC 410 
AIL, INC., MODAC 414 
AIR FORCE CAMBRIDGE ABC 
ALWAC 800 
ALWAC 3-E 

ARB. ELEK. RECH. PERM 
ARGONNE LAB. AVIDAC 
ARGONNE LAB. GEORGE 
ATOM. EN. RES. EST. HARWELL 
AUTONETICS RECOMP 1 
AUTONETICS RECOMP 2 
AUTONETICS VERDAN 
BELGIAN BELL IRSIA-FNRS 
BELGIAN BELL TEL. BELL 
BELL TEL. MARK-22 *MOD-4* 
BELL TEL. MODEL-6 
BELL TEL. MODEL- 5 
BELL TEL. LEPRECHAUN 
BENDIX G-15 
BENDIX G-20 
BIRBECK COLLEGE ARC 
BIRBECK COLLEGE SEC 
BIRBECK COLLEGE APERC 
BIRBECK COLLEGE APEXC 
BRITISH TAB. HEC-1 
BRITISH TAB. HEC-2 
BRITISH TAB. HEC-2M 
BRITISH TAB. HEC-4 *1201* 
BRITISH TAB. 555 
BRITISH TAB. HEC 1202 
BRITISH TAB. 1400 
BROOKHAVEN MERLIN 
BRUSH ELECT. ENG. CO. , LTD. 
BULL GAMMA-3 
BULL GAMMA 60 
BURROUGHS E101 
BURROUGHS 205 
BURROUGHS 220 
BURROUGHS LAB COMPUTER 
BURROUGHS UDEC-2 
BURROUGHS UDEC-1 
CAL INST. TECHNOL. MINAC 
CAMBRIDGE UNIV. EDSAC-1 
CAMBRIDGE UNIV. EDSAC-2 
CLARY DE-60 


COMP. 

RES. 

CORP. 

CADAC-102A 

COMP. 

RES. 

CORP. 

CRC-107 

COMP. 

RES. 

CORP. 

CADAC-102 


CONSOL. ENG. CORP. 36-101 
CONTROL DATA CORP. 1604 
CONTROL DATA CORP. 160 
CORBIN CORP. CORBIN 
C.S.I.R.O. MARK-1 
DANISH BOARD DASK 
DARMSTADT DERA 


DIG. 

EQUIP 

. CORP, 

, PDP 

REF. 

RES. 

TEL. EST. COMPUTER 

EIDGEN, TECHN, HOCH. R4S 

ELEC. 

LAB. 

TOKYO 

MARK-1 

ELEC. 

LAB. 

TOKYO 

MARK- 2 

ELEC. 

LAB. 

TOKYO 

MARK-3 

ELEC. 

LAB. 

TOKYO 

MARK- 4 

ELEC. 

LAB. 

TOKYO 

MUSASINO-1 


ELEC. LAB. TOKYO SENAC-1 
ELEC. LAB. TOKYO MARK-4A 
ELEC. LAB. TOKYO MARK-5 
ELLIOTT BROS. NICHOLAS 
ELLIOTT BROS. 403 
ELLIOTT BROS. 404 
ELLIOTT BROS. 402E, F 
ELLIOTT BROS. 402 
ELLIOTT NRDC 401 MARK-1 
E.M.I. ELECT. EMIDEC-1100 
E.M.I. ELECT. EMIDEC-2400 
ENG. ELEC. CO. DEUCE MK-1 
ENG. ELEC. CO. DEUCE MK-2 
ENG. ELEC. CO. DEUCE MK-2A 
ENG. ELEC. CO. KDP-10 
ERA LOGISTICS 
FACIT INC. EDB 
FERRANTI ARGUS 
FERRANTI ATLAS 
FERRANTI MANCHESTER MARK-1 
FERRANTI MARK 1* 

FERRANTI MERCURY (MARK 2) 
FERRANTI ORION 
FERRANTI PEGASUS-1 
FERRANTI PEGASUS-2 
FERRANTI SIRIUS 
FERRANTI PERSEUS 
FLAC 

FORD INST. DATAKEEPER- 1000 
FUJI PHOTO FILM CO. FUJIC 
FUJI COMMUN. FACOM-138 
FUJI COMMUN. FACOM-1288 
FUJI COMMUN. PC-2 
FUJI COMMUN. FACOM-212 
FUJI COMMUN. FACOM-222 
GENERAL ELECTRIC 150 
GENERAL ELECTRIC 210 
GENERAL ELECTRIC 312 
GENERAL ELECTRIC 100 
GENERAL ELECTRIC OMIBAC 
GENERAL ELECTRIC OARAC 
GOETTINGEN G-3 
GOETTINGEN G-l 
GOETTINGEN G-2 
GOETTINGEN G-2 
HARVARD MK 2 *DAHLGREEN* 
HARVARD MARK-3 *ADEC* 
HARVARD MARK-4 
HASLER ERMETH 
HITACHI LTD. HIPAC-1 
HITACHI LTD. HITAC-102 
HITACHI LTD. HIPAC-101 
HITACHI LTD. HITAC-301 
HITACHI LTD. HITAC-1 
HOGAN LABS CIRCLE COMP. 
HOKUSHIN ELECT. H-l 
HONEYWELL 800 
HONEYWELL DATAMATIC 1000 
HUGHES DIGITAC *AIRB* 
HUGHES DIGITAIR *AIRB* 

IBM 604 

IBM 607 

IBM 608 

IBM 305 RAMAC 

IBM 650 MOD. 1 
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IBM 650 MOD. 2 
IBM 650 MOD. 4 
IBM 701 *CRT MEMORY* 

IBM 701 *MC MEMORY* 

IBM 702 MOD. 1 
IBM 702 MOD. 2 
IBM 704 

IBM 705 MOD. 1 
IBM 705 MOD. 2 
IBM 705 MOD. 3 
IBM 709 
IBM 7080 
IBM 7070 
IBM 7090 
IBM 1401 
IBM 1620 

IBM STRETCH *LOS ALAMOS* 
IBM AN/FSQ-7 *SAGE* 

IBM AN/FSQ 7A *SAGE* 

IBM NORC 

IBM DINABOC *AIRBORNE* 

IBM COMAR- 1 610 PROT 
IBM ASEC *HARVARD MK.l* 

IBM DATA COORD/RETRIEVER 
INFO. SYSTEMS 609 
IN. P.M. RECHENAUTOMAT- I PM 
JACOBS INSTR. JAINCOMP-A 
JACOBS INSTR. JAINCOMP-B 
JACOBS INSTR. JAINCOMP-C 
JAPAN TEL. & TEL. M-l 
KIDRICH INST. COMPUTER 
LEEDS & NORTHRUP LN-3000 
LEO COMPUTERS LTD. LEO-1 
LEO COMPUTERS LTD. LEO-2 
LEO COMPUTERS LTD. LEO-2C 
LFE DIANA 

LIBRASCOPE ASN-24 *AIRB* 

LI BRA SCOPE LGP-30 
LIBRASCOPE LIBRATROL-500 
LIBRASCOPE LIBRATROL-IOOO 
LIBRASCOPE RPC-4010 
LIBRASCOPE FAA-ARTC COMP. 
LIBRASCOPE ASROC COMP. 
LIBRASCOPE CP- 20 9 
LINCOLN LAB TX-0 
LINCOLN LAB MEMORY TEST 
LINCOLN LAB TX-2 
LINCOLN LAB CG-24 
LINCOLN LAB TX-3 
LOS ALAMOS MANIAC- 1 
LOS ALAMOS MANIAC-2 
MAGNAVOX MAC-3 
MARCHANT MINIAC C,2 
MATH CENTRUM ARMAC 
MATH CENTRUM ARRA 
MATSUSHITA COMM. DM- 800 1A 
MELLON INST. COMPUTER 
METR. VICK. METROVICK-950 
METR. VICK. A.E.I. 1010 
M.I.T. WHIRLWIND-1 
MINISTRY OF SUPPLY MOSAIC 
MIT WHIRLWIND-1 *MC MOD.* 
MONROE MONROBOT-3 
MONROE MONROBOT- 5 
MONROE MONROBOT-6 
MONROE MONROBOT-MU 
MOORE SCHOOL EDVAC 


MOORE SCHOOL ENIAC 
MOORE SCHOOL MSAC 
NAREC 

NATIONAL- ELLIOTT 802 
NATIONAL- ELLIOTT 40 5M 
NATIONAL- ELLIOTT 405 
NAT. PHYS. LAB. PILOT ACE-1 
NAT. PHYS. LAB. PILOT ACE-2 
NAT. PHYS. LAB. ACE 
NAVAL RES. LAB NAREC 
NBS SWAC 

NBS SEAC *DL MEMORY* 

NBS SEAC *DL & CRT* 

NBS DYSEAC 

NBS PILOT *PRIMARY COMP.* 

NBS PILOT *SECOND. COMP.* 

NCR 10 2D 
NCR 304 
NCR 10 2A 

NEPA/USAF FAIRCHILD COMP. 

NIPPON ELECT. NEAC-1101 
NIPPON ELECT. NEAC-1103 
NIPPON ELECT. NEAC-2201 
NIPPON ELECT. NEAC-1102 
NIPPON ELECT. NEAC-2203 
NO. AMER. NATPAC 
N.V. ELECTROLOGICA XI 
OAK RIDGE ORACLE 
OKI ELECT. OPC-1 
OLIVETTI PISA 
ONR RELAY COMPUTER 

PACKARD BELL PROCESS CONTROL COMP. 3/60 

PENN ST. UNIV. PENNS I AC 

PHILCO COMPAC 

PHILCO BASIPAC 

PHILCO C-1102 

PHILCO S-1000 *SOLO* 

PHILCO S-2000 
POWERS-SAMAS PCC 
PRAGUE SAPO 

PRINCETON INST. AD. ST. IAS 
PTT PTERA 

RAND CORP. JOHNNIAC 
RAYTHEON RAYCOM 
RAYTHEON RAYDAC 
RCA 110 *PROCESS CONTROL* 

RCA 502 
RCA 503 
RCA BIZMAC 2 
RCA 501 
RCA BIZMAC 1 
RCA 504 

REA CO. READIX 
REM-RAND UNIVAC 1102 
REM-RAND UNIVAC-1 
REM-RAND UNIVAC 1101 
REM-RAND BINAC 
REM-RAND UNIVAC-2 
REM-RAND UNIVAC-3 
REM-RAND UNIVAC-1105 
REM-RAND FILE COMP. MOD. 0 
REM-RAND FILE COMP. MOD. 1 
REM-RAND TRANSTEC-2 
REM-RAND SS-80 
REM-RAND SS-90 
REM-RAND ATHENA 
REM-RAND UNIVAC 1103 



REM-RAND AFCRC MAG. COMP. 
REM-RAND X-308 *BOGART* 
REM-RAND LARC 

REM-RAND M-460 *COUNTESS* 5/58 
RICE INSTITUTE COMPUTER 
RPC- 4000 

RW-40 *AN/FSQ-27* 

RW-300 

RW-400 

RW-30 *AIRBORNE* 

SAAB CO. SARA 
SHIBAURA ELECT. TAC 
SHIBAURA ELECT. TOSBAC-3 
SIEMENS 2002 

SOC. DELECT. AUTO. CAB- 500 
SOC. DELECT. CAB-2000 

SOC. NOUV. DELECT. KL-901 
SOC. DELECT. CUBA 
SOC. DELECT. AUT. CAB-3000 
SOC. DELECT. AUT. CAB-3030 
STAND. TEL. STANTEC ZEBRA 
STANDARD ELECTRIC STANLEC 
STAND. ELEK. LORENZ ER -56 
SWEDISH BOARD BARK 
SWEDISH BOARD BESK 
SYLVANIA MOBIDIC 
SYLVANIA UDOFT 
TBA-1 

TECHNITROL 180 

TEL. RES. EST. TRE COMPUTER 

TELEFUNKEN TR-4 

TOKYO SHIBAURA CO. TAC 

TOKYO UNIV. PC-1 

TOKYO UNIV. MUSASINO-1 

UNDERWOOD ELECOM 100 

UNDERWOOD ELECOM 120 

UNDERWOOD ELECOM 125 

UNDERWOOD ELECOM 200 

UNIV. OF CALIF. CALDIC 

UNIV. OF LUND SMIL 

UNIV. OF TORONTO UTEC 

UNIV. OF ILL. ORDVAC 

UNIV. OF ILL. ILLIAC-1 

UNIV. OF ILL. ILLIAC-2 

UNIV. OF MICH. MIDAC 

UNIV. OF WISC. WISC 

UNIV. OF MICH. MIDSAC 

USSR ARAGATZ 

USSR EREVAN 

USSR M-3 

USSR VOLGA 

USSR MESM 

USSR RAZDAN 

USSR STRELA *ARROW* 

USSR BESM-1 

USSR M-2 

USSR URAL 

USSR BESM-2 

USSR KIEV 

USSR LEM-1 

USSR SETUN 

VIENNA TECH URRI 

VIENNA TECH. MAILUFTERL 

WEGEMATIC 1000 

WEIZMAN INST. WEI ZAC 

WESTINGHOUSE NORDIC-2 

WHARF ENG. LAB. MAC 


WARF ENG. LAB. M-2 

WILLOW RUN RES. CEN. MIDAC 

ZEISS RELAY COMP. OPREMA 

ZEISS ZRA-1 

ZUSE CO. Z-4 

ZUSE CO. Z-ll 

ZUSE CO. Z-5 

ZUSE CO. Z-22 




FIG. 2 TOTAL NUMBER OF DIFFERENT TYPES OF COMPUTERS DEVELOPED TO DATE 
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NUMBER OF INDEX REGISTERS PER MACHINE 
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FIG. 8 


NUMBER OF BINARY BITS PER DATA WORD 
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FIXED POINT MULTIPLY TIME - BY COMPUTER TYPE 
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HIGH SPEED MEMORY SIZE - BY CODE 
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THE HARVEST SYSTEM 


Paul S. Herwitz and James H. Pomerene 

Product Development Laboratory, Data Systems Division 
International Business Machines Corporation 
Poughkeepsie, New York 


Summary 


The Harvest System is a large-scale data 
processor designed for maximum performance 
in handling extremely large amounts of data in 
primarily non-arithmetic operations. It is 
being built by IBM under contract with the 
Government and incorporates both a new 
system organization and a stored program 
concept of macro-instructions which directly 
implement many useful data manipulating sub- 
routines. Design features include a very high 
processing rate and an on-line table lookup 
facility for effecting very general transforma- 
tions. 

Introduction 

Most of the information generated and 
handled in our society is either non-numeric 
or developed within areas supported by little 
or no theoretical structure. A general-purpose 
system for processing this type of information 
must be organized very differently from more 
conventional scientific computers. Many of 
these areas are in the early stages of scien- 
tific development where ordering and classifi- 
cation are predominant activities and where 
the major problem is to uncover patterns and 
trends upon which theory can be built. There 
are few rules for determining the relevancy 
of information so that often enormous amounts 
must be examined to achieve significant 
results. Comparatively simple operations and 
data transformations are the most useful, and 
the output desired may frequently be some 
statistical characterization of the input. 

The Harvest System is a large scale com- 
puter intended for maximum performance in 
this broad area of information processing. It 
was defined and is being built under study and 


development contracts between IBM and the 
Government. Harvest comprises a major data 
processing system reflecting the above con- 
siderations and includes an IBM Stretch com- 
puter for high performance on conventional 
operations (Figure 1). Stretch has been 
described elsewhere; only the special data 
processing portion is discussed here. 


Harvest Organization 
The Streaming Mode 

Processing in Harvest is parallel by 
character, represented by a quantity of 8 bits 
or less. This quantity is called a byte and is 
the basic information unit of the system. The 
streaming mode is primarily a design attitude 
whose aim is to select bytes from memory 
according to some preassigned pattern and to 
keep a steady stream of such selected bytes 
flowing through a designated process or trans- 
formation and thence back to memory. Empha- 
sis is on maximum flow rate so that the typical 
large volumes of information can be processed 
in minimum times. Processing time per byte 
is held to a minimum by pre specifying byte 
selection rules, processing paths, and even 
methods for handling exception cases, so that 
decision delays are suffered but once for a long 
sequence of bytes rather than being compounded 
for each byte. There is a functional analogy to 
plugboard machines except that the plugging can 
be changed at high electronic speeds and much 
of it on the basis of data encountered. 

Stream Formation 

The bytes which are selected to form the 
stream are taken from memory according to 
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either simple or complicated patterns which 
are chosen by the programmer. For technical 
reasons memory is organized into 64-bit words, 
but this artificial grouping is suppressed in 
Harvest so that memory is handled as a long 
string of bits any one of which can be addressed 
for selection. Up to 2^® words of memory can 
be directly selected, and since the word size 
is exactly 2^ a Harvest address consists of 24 
bits: 18 to select the word and 6 to select the 
bit within the word. 

Data are transferred to and from memory 
as 64 parallel bits; selection down to the bit 
is accomplished by generalized operand 
registers called stream units (Figure 2). 

Each stream unit contains also a switching 
matrix which allows a byte to be selected out 
with minimum delay, starting at any bit 
position within the register. To handle cases 
where a byte overlaps from one memory word 
into the next and to minimize waiting time for 
the next needed word from memory, each 
stream unit is actually 2 words (i. e. , 128 bits) 
long. The byte output of the stream unit is 
fed into the processing area through a bit-for- 
bit mask which enables the programmer to 
pass any subset of the 8 bits, including non- 
consecutive combinations. The selection of 
these bytes is controlledby the low order 6 
bits of a stream of 24-bit addresses generated 
by the pattern selection units. There are two 
source stream units which feed operands into 
the processing area and one sink stream unit 
which accepts results from the processing 
area. 

Pattern Selection 

The data input to Harvest may be highly 
redundant to any particular problem, and so a 
powerful mechanism is provided for imposing 
selection patterns on the data in memory. It 
is assumed that the very effective input- output 
control in the basic Stretch system has grossly 
organized the contents of memory. For 
example, various characteristics may be 
recorded for a population and recorded in 
uniform subdivisions of a file. A particular 
problem may be concerned with only a certain 
characteristic drawn from each record in the 
file. Again, for example, data may be stored 
in memory in matrix form and the particular 
problem may require the transpose of the 
matrix. 

Pattern selection in Harvest resembles 
indexing in other computers, except that in 
Harvest the programmer determines the 
algorithm which generates the pattern rather 


than listing the pattern itself. Each stream 
unit has its independent pattern generating 
mechanism which is actually an arithmetic 
unit capable of performing addition, subtrac- 
tion, and counting operations on the 24-bit 
addresses. The programmer specifies 
patterns in terms of indexing levels. Each 
level consists of an address incrementing 
value I which is successively added to the 
starting address value S until N such incre- 
ments have been applied, after which the next 
indexing level is consulted to apply a different 
increment. The programmer may then 
choose that incrementing continue on this 
level or that the previous level be resumed for 
another cycle of incrementing. 

Many other indexing modes are provided 
to permit almost any pattern of data selection. 
Particular attention has been given to direct 
implementation of triangular matrix selection 
and to the iterative chains of any formal 
inductive process, however complex. 

In general, the pattern selection facilities 
completely divorce the function of operand 
designation from that of operand processing, 
except that pre-designated special character- 
istics of the operands may be permitted to 
change the selection pattern in some fashion. 

Processing Facilities 

The pattern selection units determine the 
movement of data between the stream units and 
memory and, together with the stream units, 
determine the byte flow in the processing area. 
The processing facilities, together with the 
selection facilities, have been designed to 
give a flow rate of approximately 4 million 
bytes per second. Source stream units to the 
processing area are stream units P and Q 
while the sink stream unit is stream unit R. 

Transformation Facilities 

Two facilities are provided for the trans- 
formation of data (Figure 3). Extremely 
general operations on one or two input vari- 
ables can be accomplished with the on-line 
table lookup facility . Simpler operations can 
be done directly by the logic unit without 
involving memory lookup. The logic unit also 
provides a choice of several one-bit character- 
izations of the input bytes (such as Byte from 
P > byte from Q). These one-bit signals can 
be used to alter the stream process through 
the adjustment mechanism. 

The table lookup facility consists of two 



units. The more important logically is the 
Table Address Assembler which accepts bytes 
from one or two sources and from them forms 
the lookup addresses which are sent to memory 
(Figure 4). The other is the Table Extract 
Unit which permits selection of a particular 
field within the looked-up word. Both units 
have their own indexing mechanisms and 
together they permit the programmer to 
address a table entry ranging in size from one 
bit up to a full word and starting at any bit 
position in memory. This freedom is 
abridged only by considerations of the table 
structure chosen by the programmer. 

The table lookup facility also provides 
access to the memory features of Count and 
Existence. Under instruction from the Table 
Address Assembler the medium speed memory 
can use the assembled address to logically OR 
a one into the referenced bit position. The 
referenced word as it was just before the 
ORing can be sent to the table extract unit. 

In the high-speed memory a one may be either 
ORed or added into the referenced bit position 
with the same provision for sending the word 
before alteration to the table extract unit. 

The ability to add ones into high-speed memory 
words permits use of these words as individual 
counters. Several counter sizes can be 
specified. 

Statistical Aids 

The table lookup facility may be used to 
associate statistical weights with the occur- 
rence of particular sets of bytes. For 
example, the occurrence of a byte in the P 
stream together with a byte Qj in the Q 
stream may be assigned a weight Wjj, which 
would be stored in a table and referenced by 
an address formed from both Pj and Qj. 
Alternatively a memory counter may be 
associated with each pair P^, Qj and stepped 
on the occurrence of each such pair. 

A Statistical Accumulator (SACC) is 
provided (Figure 5) either to sum the weights 
W over a succession of sets of bytes or to 
provide a key statistical measure of the 
counting results. In addition, SACC can be 
used for many other accumulating purposes. 

A Statistical Counter (SCTR) provides a 
way of counting the occurrences of any of a 
large number of events during a stream. In 
particular, SCTR can be designated to count 
the number of weights W which have been 
added into SACC. 


The Stream Byte-by-Byte Instruction 

The table lookup unit, the logic unit, and 
the statistical units can be connected into the 
processing stream in various ways by the 
programmer. Like a class of analog com- 
puters, these connections reflect the structure 
of a problem and are the electronic equiva- 
lents of a plugboard. The hookup chosen by 
the programmer then applies the same pro- 
cessing to each byte or pair of bytes sent 
through it; this very general processing mode 
is called the Stream Byte-by-Byte instruction. 
The connections, indexing patterns, and 
special conditions described below all form 
part of a pre-specified setup which can be 
considered as a macro-instruction putting the 
computer into a specific posture for a specific 
problem. 

Monitoring for Special Conditions 

The concept of a streaming process 
determined by a flexible and extensive setup 
specification is most meaningful when applied 
to a large batch of data which is all to be 
treated the same way. In any particular 
streaming process, special conditions may 
arise within the data being processed which 
call for either momentary or permanent 
changes in the process. For example, the 
transformation being performed may be 
undefined for certain characters so that these 
must be deleted at the input; or a special 
character may be reserved to mark the end 
of a related succession of bytes after which 
the process or the pattern of data selection 
must be altered. 

Special conditions can be monitored in 
several ways (Figure 5). Special characters 
can be detected by Match Units, each of which 
can be assigned a special 8-bit byte to match. 
There are four Match Units; W, X, Y, and Z, 
which can be connected to monitor the stream 
at several different points. When a match 
occurs, the Match Unit can directly do one of 
several operations on the stream and can also 
emit a one -bit signal indicating the match. 

A large number of other one-bit signals 
are generated by the various stream facilities 
to mark key points in their respective pro- 
cesses. These one-bit signals, collectively 
called stimuli, can be monitored to accomplish 
specific operations, such as stepping SCTR or 
marking the end of an indexing pattern. They 
can also be used to accomplish a much wider 
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range of operations through the adjustment 
mechanism. 

Up to 64 stimuli are generated by the 
various processing, indexing, and monitoring 
functions in Harvest. For any particular 
problem those stimuli which represent the 
key or significant properties of the data being 
streamed can be chosen. To each stimulus or 
coincident combination of stimuli the pro- 
grammer may associate one or more of a 
large number of reactions on the stream; its 
data, its process, or its indexing. These 
stimulus -reaction pairs are called adjustments. 
The adjustment mechanism gives the pro- 
grammer a direct way of picking out those 
elements of the stream which are different 
from the general run. These different 
elements may provide the key to the pattern 
being sought, either because they are particu- 
larly relevant or because they are distinctly 
irrelevant. 

Programming Harvest 
The Instruction Set 

Conventional arithmetic and scientific 
computational processes and all input-output 
operations for Harvest are performed in the 
Stretch computer which is part of the system. 
When Stretch instructions are used, the 
system operates in the arithmetic mode; when 
streaming mode control is imposed, the 
unusual instructions unique to Harvest are 
available, There are about - 85 instruction 
families (i. e. , instruction types plus associated 
operation modifiers) in Stretch alone. The 
Harvest stream instructions add a variety of 
extremely powerful data processing tools to 
the several thousand basic Stretch operations. 
Throughout the system the instruction formats 
vary in length: single-address instructions are 
32 bits long, two-address instructions and 
instructions that operate on variable length 
fields are 64 bits long, and stream instructions 
have an effective length of 192 bits. 

Streaming mode instructions are very 
much like built-in subroutines or macro- 
instructions. Just as it is necessary to 
initialize a programmed subroutine, it is also 
necessary to initialize or set up the Harvest 
processor. Roughly about 150 parameters and 
control bits may influence streaming. Harvest 
is set up by loading values of some of these 
parameters into and setting the desired control 
bits in specific, addressable setup registers 
prior to the execution of a stream instruction. 
Certain changes in the parameter values or 


control bit settings generate stimuli which 
may be used to terminate or cause automatic 
adjustments to be made to the stream, or to 
cause a change to the arithmetic mode of 
operation. The adjustment operations 
essentially constitute a second level of stored 
program and are used most generally to 
handle the exception cases that occur during 
processing operations. Thus the programmer 
sets up Harvest to execute a stream instruc- 
tion; execution begins and is automatically 
modified as changing data or setup dictates; 
much routine bookkeeping is done automati- 
cally by the several independent pattern 
generating (indexing) mechanisms; changing 
values of parameters are always available for 
programmed inspection if automatic inspec- 
tion is not sufficient for the particular opera- 
tion being performed. 

While most of the programming in the 
streaming mode of operation is centered 
around the Stream-Byte-by-Byte instruction, 
a number of other instructions derive from 
the unique organization of Harvest. The 
arrangement of the Harvest data paths and pro- 
cessing units facilitates the inclusion of opera- 
tions that perform within one instruction each 
many of the routine collating functions such as 
merging, sorting, and file searching and main- 
tenance that are so common to commercial data 
processing. Facilities of the table lookup unit 
are used extensively in these as well as in 
several other instructions designed primarily 
for the logical manipulation of data. 

Since such extensive use is made of tables 
of parameters, table transformations, and 
other data arrays in the Harvest approach to 
data processing, a special Clear Memory 
instruction is available for clearing large 
blocks of memory in minimum time and with 
minimum programming effort. A single 
execution of this instruction will clear as few 
as 64 consecutive words or as many as 2048, 
depending on the programmer's wishes. The 
execution time in the latter case is less than 
3-1/2 (isec, and only one instruction access 
from memory is made. A full memory com- 
plement of 2*8 words can be cleared in less 
than one millisecond. 

Collating Operations 

Generally speaking, in order to perform 
merging, file searching, and other such 
collating operations, it is necessary to specify 
a number of parameters such as record length 
file length, control field length and position, 
etc. For Harvest, these parameters need 
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only be tabulated in proper order by the pro- 
grammer. They are then utilized by the 
indexing mechanisms to cause data to be 
picked from and later be stored into memory 
according to the patterns that naturally occur 
in such data. 

The Merge instruction family actually 
contains eight independent control sequences 
that may be used to merge files or even to 
completely sort blocks of records with but a 
single instruction access from memory. The 
particular option chosen by the programmer 
depends upon whether files are to be arranged 
in ascending or descending order, whether or 
not the record block can be contained in at 
most half the available memory, and whether 
the control field heads the record or is offset. 

As an indication of the processing speed of 
Harvest, in the most favorable case (one-word 
records with control fields at the beginning of 
records) a block of 30,000 records already in 
memory can be completely sorted in 1-1/4 
seconds or less. 

The Search instruction complex consists of 
twelve control sequences, each of which facili- 
tates abstracting from a master file all records 
whose control fields bear any specific one of six 
possible relationships to the control field of each 
record of a detail file. The possible relation- 
ships are the six standard comparison conditions 
< , t=. , ^ ^ , =, 4 . If it is not desired to move 

the records that meet the search condition, it is 
possible to tabulate their addresses automatically. 

Another complex called Select permits the 
programmer to select from a file that record 
having the least or greatest control field. 

For the purpose of facilitating file main- 
tenance operations, Harvest includes a collating 
instruction complex called Take-Insert-Replace. 
When the operation is executed under "instruc- 
tion control, " then whenever master and detail 
record control fields match, either the master 
record is taken out of (deleted from) the master 
file or is replaced by the detail record. Under 
"data control" the action taken whenever control 
fields match is indicated by the contents of a 
special control byte in the detail record. The 
masters can be deleted or replaced, or the 
detail record can be inserted in the master file; 
or under certain circumstances the maintenance 
procedure can be interrupted when master 
records with special characteristics are located, 
and resumed with a minimum of programming 
effort when desired. 


Instructions such as the collating operations 
described above lead to a considerable reduction 
in the length of the generalized report genera- 
tors, file maintenance routines, sorting and 
merging programs, etc. , that might be expected 
to become associated with such a computer 
system. 

Table Lookup Operation s 

It is often desired to be able to obtain data 
from or store data at an address that is not 
directly dependent on the data itself. The 
Indirect Load- Store instruction complex permits 
wide latitude in the formation of such addresses 
and in the subsequent manipulation of the 
original data. In essence the operation is as 
follows: parameters from one of the source 
stream units are used in the formation of an 
address in the table lookup unit; either this 
primary address itself or either of the addresses 
found in the word at the memory location 
specified by the primary address becomes 
either the origin of a field of data to be entered 
via the other source stream unit or the location 
at which the data field is to be stored by the 
sink stream unit; the data is moved from source 
to sink and the entire cycle is repeated. The 
counting and ORing features of the table lookup 
unit are available to the programmer as modi- 
fications of the basic instruction control 
sequence. 

The second instruction complex built around 
the table lookup unit is called Sequential Table 
Lookup. An extremely powerful but concep- 
tually simple instruction, it permits a class of 
transformations to be performed that may best 
be described as data dependent. The instruc- 
tion causes a series of table references to be 
made, each successive reference after the first 
being made in a table whose address is extract- 
ed automatically from the previously refer- 
enced table entry. Also, as each reference is 
completed, a variable amount of data may be 
extracted from the table entry. Moreover, the 
indexing of the input or output data may be 
adjusted according to the contents of the table 
entry (similar to the operation of the Turing 
machine). The applications of Sequential Table 
Lookup are manifold; the editing for printing 
of numerical data, the transliteration of Roman 
numerals to Arabic, and the scanning of sym- 
bolic computer instructions for assembly and 
compilation purposes are but a few. 

The extensive use of tables in problem 
solution typifies the different approach the 
programmer will take with Harvest. The 
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problem of transliteration of Roman numerals 
to Arabic illustrates the power of the method. 
Several simplifying assumptions have been made 
so that the flow chart is easier to follow. First, 
the data - a set of numbers expressed in Roman 
numerals, each number separated from the next 
by a blank (B) - is assumed to be perfect, and 
only the characters I, V, X, L, C, D, and M 
are used. Second, the set of numbers is ter- 
minated by two blanks, Third, the use of four 
successive identical characters (as Roman IHI 
for Arabic 4) is outlawed. Finally, the numbers 
to be transformed are all assumed to lie on the 
range 1 to 1000 , inclusive. 

The flow chart ( Figure 6 ) shows the eight- 
een tables ( consisting of a total of eighty-two 
memory words ) used. Under each table 
heading a two-part entry is shown, the parts 
separated by a colon. On the left of the colon 
is the argument being looked up, followed in 
parentheses by an indication of the range on 
which the number or digit that will eventually 
result must lie. On the right of the colon the 
parameters of the table word corresponding to 
the argument are indicated symbolically: e. g. , 
RO-1B (meaning "read out the integer 1 followed 
by the character for blank ”) or NRO (meaning 
"no readout"). This is followed by an integer in 
parentheses indicating what data byte is the next 
argument (0 means same byte, 1 means next 
byte, etc. ); the arrow indicates the table in which 
the next argument is looked up. 

As an illustration, consider the trans- 
literation of DCLXXVIII : 

1. D is looked up in the First Table ; the 
number must be on the range 500 through 899. 

No digit is read out. The next argument is the 
next data byte. 

2. C is looked up in the Table; the 
range must be 600 through 899. No readout. 

The next argument is the next data byte. 

3. L is looked up in the DC^ Table; the 


range is 650 through 689. Read out 6 . The 
next argument is the next data byte. 

4. X is looked up in the L^ Table; the 
range of the unknown part of the number is 60 
through 89. No readout. The next argument 
is the next data byte. 

5. X is looked up in the LX^ Table; the 
range is reduced to 70 through 89. No readout. 
The next argument is the next data byte. 

6 . V is looked up in the LX., Table; the 
range is now 75 through 79. Read out 7. The 
next argument is the next byte. 

7. I is looked up in the V Table; the range 
of the final digit is 6 through 8 . No readout. 

The next argument is the next data byte. 

8 . I is looked up in the Table; the 
final digit is 7 or 8 . No readout. The next 
argument is the next byte. 

9. I is looked up in the V 3 Table; the final 
digit is 8 . Read out 8 B. The next argument is 
the second following byte (the next byte is a B), 
i. e. , the first byte of the next number to be 
transliterated, and is looked up in the First 
Table . 

The process just described yielded the 
number 678 for DCLXXVIII. Only one instruc- 
tion was accessed from memory (the Sequential 
Table Lookup) and, in fact, this single access 
served to transform the entire set of numbers . 
The process terminated when the character B 
was looked up in the First Table. 

Clearly the decision logic for the problem 
is incorporated in the structure of the tables. 
However, in constructing these tables the 
programmer must concentrate on precisely 
this logic; most of the bookkeeping and other 
peripheral programming considerations are 
automatically taken care of. Wherever it was 
possible, this philosophy guided the systems 
planning of Harvest. 
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MEDIUM SPEED MEMORIES 

16,384 WORDS EACH HIGH SPEED MEMORIES 



EXCHANGE ARITHMETIC SECTION STREAMING MODE SECTION 

(STRETCH) 

Fig. 1. The Harvest System. 



Fig. 2. The stream unit. 
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Fig. 5. Monitoring and statistical features with typical adjustment reactions. 
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FIRST TABLE 
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2(l5SnSl8) RO- 1(1) 

X (20sns 39).NR0 (I ) 


First Table 
• X, Table 
■ 3Z| Table 
X2 Table 


B(n=20). R0-20B(I) 

I(2lsns24 or n*29V R0-2(l)- 

2(25sns28): R0-2(l) 

X(30sns39) R0-3(l) 


B(n=7) R0-7BU) - 
I(n=8) R0-8B(2) - 


First Table 

■ 1 1 Table 

■ 37) Table 

• Ones Table 


B(n=0) RO-OB(I) 

I(lsns4 or n=9). NRO(I)- 
I (5sns8) NRO (l)< 


* First Table 

* I| Table 

* 3E|Table 


L (40s n$ 49) R0-4(l)- 
C (90S ns 99) R0-4(l)- 


L(50SnS89) NRO(I) - 


B(n=50). R0-50B(I) 

I(5lSnS54 or n»59). R0-5(l)- 

2(55sns58): R0-5(l) 

X(60sns89). NRO(I) 


First Table 

1, Table 

2, Table 
LX, Table 


B(n=60): R0-60B(I) 

I(6lsns64 or n=69)‘ R0-6(l)- 

2(65sns68). R0-60) 

X(70sns89) NRO(I) 


• First Table 

• 1 1 Table 
2, Table 

• LX 2 Table 


B(n=70). R0-70B(I) 

I(7lsns74 or n=79) R0-7(l)- 

37(75SnS78) R0-7(l) 

X(80sns89)’ R0-8(l) 


• First Table 
■ 1 1 Table 

2, Table 

• Ones Table 


C(t00sns499 or 900SnS999) NRO(l)- 


B(n=IOO) RO-IOOB(l) 

I(l0ISnSI04 or n=l09) RO-IO(I) - 
37(105 Sn< 108) RO-IO(I) 


X(l!0<nSI49orl90SnSl99) RO-l(l)- 

L(l50SnSl89)' RO — 1(1) 

C(200SnS 399)' NRO(I) 


■ First Table 

■ 1 1 Table 

• 2) Table 

• X 1 Table 

• L, Table 

• C2Table 

B(n=200). R0-200B(I) - 


D(400SnS499) R0-4(U- 
M(900SnS999) R0-9(l)- 


I (201 S n S 204 or n=209). R0-20(l) 

2(205 SnS 208) R0-20(l) 

X (210 5 n S 249 or 290 S n S 299) R0-2(l)- 

L(250 S n S 289). R0-2(l) — 

0(300 S n S 399): R0-3(l) 


■ Tens Table 
Tens Table 


► First Table 

- I) Table 

► 2, Table 

- X, Table 

► L| Table 

► Tens Table 


B(n=00) RO-OOB(I) 

1(01 S n S 04 or n=09). RO-O(I) - 
2(05 S n S 08). RO— 0(1) 


X(IOSnS49or90£nS99). NRO(I) - 
L(50£n£89) NRO(I) 


' First Toble 
' I) Table 

► 2, Table 
• X, Table 

► L| Table 


D(500SnS899). NRO(D- 


— ► First Table 
— ► I, Table 
— ► 2, Table 

X(5lO£nS549or 590SnS599)R0-5(l) ► X,Table 

L(550SnS589):RO— 5(1) ► L, Toble 

C(600SnS899) NRO(I) ► DC|Table 


B(n = 500). RO— 500B(I) 

I(50ISnS504 or n=509) RO-5O0)- 
2(505 SnS508) R0-50(l) 


B(n=600) R0-600B(I) 

I(60ISnS604 or n = 609).RO-60(l) — 

2(605SnS608): R0-60(l) 

X(6K)SnS649 or 690SnS699) R0-6(l)- 

L(650Sns689).R0-6(l) 

C(700SnS899) NRO(I) 


► First Table 

► I, Table 

► 2, Table 

► X| Table 

► L| Table 

► DC2Table 


B(n= 700): RO-7OOB0)- 


First Table 
I, Table 
2) Table 

X(7K)SnS749 or 790SnS799) R0-7(l) — ► X|Toble 

L(750SnS789) R0-7(l) ► L, Table 

C(800s nS899)- RO— 8(1) ► Tens Table 


I(70ISnS704 or n=709):R0-70(l)- 
2(705SnS708) RO-700)- 


M (n * 1000). RO-IOOOB (2) - 


[j8 tables; 82 words (table entries)]] 


Fig. 6. 
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ORGANIZATION OF COMPUTER SYSTEMS — THE FIXED 
PLUS VARIABLE STRUCTURE COMPUTER* 

by Gerald Estrin 

Department of Engineering and Department of Mathematics 
University of California, Los Angeles 
Los Angeles, California 


I. Introduction 

The past decade has seen the development of 
productive fast electronic digital computers. Sig- 
nificant problems have been solved and significant 
numerical experiments have been executed. More- 
over, as expected, a growing number of important 
problems have been recorded which are not practi- 
cably computable by existing systems. These lat- 
ter problems have provided the incentive for the 
present development of several large scale digital 
computers with the goal of one or two orders of 
magnitude increase in overall computational speed. 

The following discussion attempts to charac- 
terize aspects of digital technology related to 
extending the bounds of practicable computability 
and a new concept of computer system organization 
is proposed. 

Component Technology 

During the past semi-decade advances in reli- 
ability and speed have been produced by the devel- 
opment of magnetic core memories, new semi-conduc- 
tor switching elements, Improved vacuum tubes and 
superior input-output media. Further speed in- 
creases are promised by magnetic thin films, low 
temperature devices, semiconductors, microwave and 
microminiaturization techniques. The fastest sin- 
gle component switching speeds being discussed are 
about 10“° second. For any significant parallel 
information transfers most researchers observe a 
loss of a factor of lCr or ICk bringing basic par- 
allel computer operations to 10“° or 10”' second. 
These speed losses are a function of driving cir- 
cuit limitations, length of lines, requirements 
for coincidence of independent applied forces, pro- 
pagation time of output signals, amplification de- 
lay, destructive observation of memory cells, and 
requirements for simultaneous observations of the 
results of information transfers. 

One cannot expect to do very much better with 
the populations of switching elements without a 
change in the method of organizing them. We may 
expect significant effects of the great effort be- 
ing put into ’•microminiaturization" techniques. 

Any method which can accomplish more complex digi- 
tal operations in a smaller volume will signifi- 
cantly affect overall speed if the basic element 
switching speed and the digital nature of the op- 
eration are not lost in the process. 


■+ The preparation of this paper was sponsored, in 
part by the Office of Naval Research, and the 
Atomic Energy Commission. Reproduction in whole 
or in part is permitted for any purpose of the 
United States Government. 


Embryonic Systems 

The most powerful digital computers either 
new-born or in development or under study are uni- 
versal stored program machines which provide par- 
allelism of operations 1-1 -'. This parallelism is 
introduced not just at the level of the bits in a 
number but rather by preparing new operands and 
instructions while arithmetic and logical opera- 
tions are executed j by processing input and output 
data simultaneously with operation of the high 
speed machine j by providing more than one arith- 
metic unitj and by controlling the parallel opera- 
tion of more than one machine. 

Practicable Comput ability 

In a finite system practicable computability 
is a function of such mundane factors as cost of 
computing and total delay from formulation of a 
problem to interpretation of results. These fac- 
tors must of course be added to round-off, trunca- 
tion, problem reformulation and machine reliability 
all of which determine whether there is a relation 
between the symbols coming out of a computer and 
the answer to the original question posed. 

The most significant advances are to be ex- 
pected from the formulation of better methods of 
solving problems, of organizing physical elements 
for problem solution and the interaction between 
these two approaches. Any factors which enhance 
mathematical experimentation and man-machine learn- 
ing are important system characteristics. With 
existing computing systems man-machine learning 
has been exercised in the past by the growth in 
libraries of subroutines and automatic programming 
methods, by growth in skill of the expert program- 
mer, and by accumulated experimental results con- 
cerning the success and failure of various numeri- 
cal methods and deficiencies in the machine design. 
There has been a particularly dramatic change in 
the language required to communicate with machines. 

Structural Rigidity 

In the process of design of any general pur- 
pose computer, a command set is selected on the 
basis of past experience and analysis of some lim- 
ited problem set. With a universal computer it is 
known that it is always possible to program any 
special command. The compromise between s impli- 
cit y, cost and speed resulted in short command 
lists in the first generation of machines. As 
speed and ease of programming became more signifi- 
cant, command lists lengthened. The "microprogram- 
ming nl ^~l 7 approach made it feasible to change the 
command list by resequencing of a defined set of 
microoperations . 
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II. Design Objectives 

The primary goal of the Fixed Plus Variable 
Structure Computer is: 

(1) To permit computations which are beyond the 
capabilities of present systems by providing 
an inventory of high speed substructures and 
rules for interconnecting them such that the 
entire system may be temporarily distorted 
into a problem oriented special purpose com- 
puter. 

In the following the above design objective 
will be expanded in an attempt to remove obvious 
ambiguities . 

(la) M ... computations which are beyond the capa- 
bilities of present systems..." 

The problems considered here are those for 
which some model exists — that is, a set of 
rules according to which independent variables 
interact to produce change in dependent vari- 
ables. Given such a model and given a ques- 
tion concerning the behavior of the dependent 
variables, then if existing systems require 
too much time to provide an answer to the 
question in a usable form, the problem is con- 
sidered beyond their capabilities. 

The least ambiguous cases are those for 
which there is an explicit time limitation — 
commonly called real time problems. In such 
cases computations are often required to pre- 
dict changes in a process and prescribe cor- 
recting forces according to chosen criterion 
functions. The absolute time available for 
computation is determined by the process to 
be controlled and may vary from the order of 
10“' second to the order of a year. In the 
former case the operation of a single binary 
element is just beyond the horizon. In the 
latter case it is possible to carry out about 
lO 1 ^ sequential operations during that time 
interval. It is of course possible to remove 
the real time pressure and test the predictive 
power of a model for one stage of a process. 
However if the effect of the corrective force 
is to be determined through several stages it 
is necessary to either be constrained by the 
time restrictions of the process or be cap- 
able of simulating the process on an expanded 
time scale. 

Most problems have more implicit constraints 
on computability. Thus if a particular budget 
is allowed for the solution of a given prob- 
lem, the cost per hour and computation time 
will determine computability. 

In some cases , the value of a solution may 
be very high because of widespread influence 
on future technology but it may take so long 
to conduct numerical experiments that the in- 
teraction between investigators and experiment 
is essentially vitiated. In these cases user 
personality may determine computability. 

Some of the problems which exist as a 


constant challenge to computer designers and 
mathematicians are the numerical solution of 
partial differential equations in four inde- 
pendent variables, nonlinear partial differ- 
ential equations of lower dimensionality, me- 
chanical vibrations in solids, truly three 
dimensional problems in elasticity and plas- 
ticity, multiple integration problems arising 
in connection with the equation of state, ra- 
diation diffusion, complex design problems 
such as reactor design, combinatorial probLems 
and problems associated with experiments in 
number theory. 

One may also consider the aggregate of 
smaller problems as a distinct problem type. 
Most computer manufacturers are in fact con- 
centrating their attention on computer organ- 
izations which can process several independent 
computations in parallel so as to maintain a 
high duty factor in all elements of the sys- 
tem and reduce the cost per operation. Al- 
though the variable structure computer will 
consider such a property as a design factor, 
major emphasis will be placed on overall com- 
putation time of individual problems which 
cannot be handled by existing systems. 

(lb) "...an inventory of high speed substruc- 
tures ..." 

The Fixed Plus Variable Structure Computer 
will consist of a set of independent digital 
complexes ranging in size from individual 
switching elements or flip-flops to, say, 
shift-registers and counters to a high speed 
general purpose computer. The latter element 
is "fixed" in the sense that the machine user 
is presented with some minimum vocabulary and 
some minimum set of machine characteristics 
which do not change rapidly with time and 
which do not require the specification of some 
variable set of interconnections for each 
problem. 

The existence of the "fixed" general pur- 
pose computer as an element in the system is 
considered essential to the evolution of high- 
er languages for man-machine communication. 

If either the instruction code were always 
changing or the meaning of the instructions 
were always changing, any tendency for humans 
to invest effort in the definition of more 
complex instructions would be quickly damped 
despite the ability to make use of the vari- 
able structure inventory to effect higher larg- 
uage operations. The items in the inventory 
list are described by their inputs, outputs, 
logical functions, permissible loading, speed 
of operation, noise sensitivity, etc. The 
manner in which the initial inventory of sub- 
structures is determined is discussed in the 
second part of this paper. 

(lc) "...rules for interconnecting them such that 
the entire system may be distorted into a 
problem oriented special purpose computer..." 

The items listed in the inventory are not 
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in spatial positions relative to each other. 
Every effort is to be made to minimize para- 
sitic resistance, capacitance and inductance 
in the interconnections such that the cir- 
cuits may operate at speeds near those of 
equivalent fixed circuits. Wherever possible 
programmable electronic switching from one 
configuration to another will be considered 
but manual switching (e.g. soldering iron) 
will be considered if it leads to improved 
performance. It should be observed that sub- 
structures may of course operate in parallel. 

(2) To permit computations which are beyond the 
capabilities of the initial system by estab- 
lishing characteristics such that the inven- 
tory of substructures and the rules for in- 
terconnection may be changed as a function of 
past experience and new technological advances; 

For example if it is found that an assem- 
bly of a number of the inventory items is used 
very frequently, then it may be desirable to 
introduce that assembly as a separate inven- 
tory item with speed optimized fixed wiring. 
In fact over longer periods, it may be obser- 
ved that some substructures are used so fre- 
quently that they should be part of the "fix- 
ed” structure general purpose computer 30 that 
there is no need to explicitly specify the 
substructure each time it is used. 


serves to highlight the type of evaluation which 
may precede the design of any particular special 
purpose substructure. Consider the polynomial, 

y = C 0 + Cix + ... + CjjX 11 (1) 

The most common method called nesting or Horner’s 
method effects the calculation as 

y = ^... ((On* + C n _ x ) x + C n-2 )x + ... + C^j (2) 

If one assumes that "multiply-accumulate" is in- 
cluded in the set of elementary machine operations 
then the highly sequential graph of Fig. 2a demon- 
strates the computational flow. If there are no 
constraints on equipment, a tree structure of par- 
allel arithmetic operations may be used to minimize 
the number of sequential steps as illustrated in 
Fig. 2b. An intermediate sequence suggested by K. 
Ralston makes efficient use of two multipliers 
falling just shy of doubling the speed of function 
evaluation. The process which consists of essen- 
tially separating odd and even terms, factoring, 
and using Horners method on the half length poly- 
nomials is illustrated in Fig. 2c for 

y = C 0 + Ojx + C 2 x 2 + ... + CnX n (3) 

2 2m 

y = C 0 + C 2 x + . . . + C 2m x 

, 9 2m v 

+ x( C]_ + C 3 X + . . • + C2m+l x ) 


Figure 1 illustrates the relation between the 
Fixed Machine, the Variable Structure part, the 
Input-Output, Supervisory Control and the Humans. 

III. Scope of Study for I*F + VS 1 

L J o 

Investigators at UCLA are now in the process 
of studying problems and compiling information 
which will permit a first design of the variable 
structure computer system. 

Major emphasis is being placed on the produc- 
tion of a library of substructures. There will be 
two primary sources of that library. The first is 
the design of configurations for frequently used 
subroutines utilized by existing computer centers. 
The second is the design of configurations for par- 
ticular complex problems of the type listed in the 
first part of this paper. In the latter problems 
many subroutines (substructures) must be linked in 
in the integrated process of problem solution. 

In each case the computational method is an- 
alyzed; other computational methods are studied for 
possible increased overall speed; the computational 
methods are described in an essentially sequential 
form, i.e. the minimum number of element ary opera- 
tions which must occur in sequence even if avail- 
able equipment is unlimited (excluding total table 
lookup); and a recommended special purpose compu- 
ter organization is defined assuming some reason- 
able equipment constraint. 

The simple process of polynomial evaluation is 
one which is fairly efficiently handled by the con- 
ventional computer. However its discussion below 


y = ^... ^ (C 2 njX + C2m-2) x + fymrh} 

+ x ^... ((C 2m+1 x + C 2m -i) x 

x + ... + (U) 

The simple nesting procedure represented by 
Fig. 2a, requires a single multiplier. At each in- 
terior time step the result of a previous multiply- 
accumulate operation becomes a multiplicand with 
the multiplier, x, unchanged and a new constant ad- 
ditively introduced. This is a very efficient op- 
eration requiring four initial memory accesses for 
the instruction and the first three operands fol- 
lowed by n multiplication times and n-1 access 
times for the additive constants. In a machine 
utilizing an anticipatory control the latter con- 
stants might be obtained while multiplication was 
proceeding. If such a machine provided a multip- 
lication time which was a function of the operands 
one would obtain the time for polynomial evaluation, 
n 

^pe^ ^access + 21 t x 

i=l 1 

where t x . = the time required for the i^ h 
1 multiplication 

For n = 2m or 2m-l, the process represented by Fig. 
2b uses m+l arithmetic units during the first step 
with the required number of units reduced accord- 
ing to the same relationship at each of the I steps 
where 


x + . . . + C 0 

/ 

+ c 2m-3 > ) 
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I < log 2 n < I + 1 

There is always a final step utilizing a single 
arithmetic unit and resulting in a total of I + 1 
sequential multiply-accumulate operations. If 
only a single arithmetic unit were available the 
operations implied by Fig. 2b could be sequentially 
effected in (n + I) multiply accumulate operations. 
Two arithmetic units would reduce the number of 
steps to approximately (n/2 +1). It is also of 
some significance to observe that the increased 
parallelism reduces the effect of information in 
the operands. The time required up to any node in 
the tree is determined by the longest computation 
time in the branches leading into that node. 


However, an algorithm was devised by C. B. 
Tompkins which made use of a multi- radix represen- 
tation of a number representing each permutation. 
Some twenty students were asked to program the 
generation of these permutations on eleven differ^ 
ent modern computers far n = 10, i.e. nl xlO . 
The time estimates ranged from 7 to 3U>000 hours. 
An extremely simple special purpose accumulator ef- 
fecting variable radix addition reduced the estim- 
ated time to about 8 seconds. 

As a result of design efforts like these a 
library of substructures will be produced and then 
overlayed to aid the first specification of the 
variable structure inventory. 


The process represented by Fig. 2c uses two 
arithmetic units and if we assume two independent 
memories the process requires a time, 



+ t_ + t„ 

X in Xf 

where t x ^ n is the time required for the ini- 
tial multiplication forming x^ and is the time 
required for the final multiplication. Thus this 
process is more efficient than that of Fig. 2b for 
most pases in which there is a constraint on the 
number of available arithmetic units. With two 
arithmetic units only m-f2 steps are required. What 
is the criterion by which one may select among 
these various methods? The focus of this computer 
system is on increasing the computability of each 
problem formulated for it. The criterion for de- 
ciding between alternate sets of substructures is 
the estimated time for a given amount of computa- 
tion or equivalently the amount of computation 
achieved in a given time. The alternate substruc- 
tures themselves are constrained by the total avail- 
able inventory and the inventiveness of the de- 
signer. Referring back to the simple polynomial 
evaluation, if a problem requires the independent 
evaluation of many polynomials, it might be more 
efficient to use independent substructures each ac- 
ecuting the simple sequential version of Horner's 
method. On the other hand if the quantities enter- 
ing into one polynomial are dependent upon the cal- 
culation of the previous one, the more parallel 
substructures will probably be more efficient. 
Finally it would be desirable to compare all of the 
above with substructures whose design is focussed 
on reducing the time of any single multiply-accunu- 
late operation. 


In fact, applications for which the variable 
structure computer exhibits its greatest strength 
are those which may utilize abnormal number repre- 
sentations such as multiple precision, partial pre- 
cision or multi-radix representations. 

For example, consider the problem of generat- 
ing the permutations of n unlike objects, where the 
objective is actually evaluating some function of 
the permutation. There are nl such permutations 
and so purely the process of systematically gener- 
ating them is limited to fairly small values of n. 


A number of new high speed computers are be- 
ing evaluated as possible candidates for the fixed 
structure computer. In addition to summarizing 
their primary characteristics, the instruction 
lists of the several machines are being described 
in a common notation in order to make comparison 
easier and to intelligently process the following 
typical questions. 

"Which instructions must be included 
in "F" to enhance the overall system 
without deteriorating the performance 
of "F" alone." 

"How simple is it possible to make "F" 
because "V" is included in the system?" 

"Does "F" permit extension of the nor- 
mal word length by "V"?" 

"How easy is it for "V" to operate in 
parallel on partial word lengths, i.e. 
how fast, can packed words be trans- 
ferred from "F" to "V"?" 


"How much memory should be shared by 
"F» and "V"?« 

These question!? are primarily related to "F" — 
another set is typical of the questions pointed to- 
ward the variable structure computer. 

Assuming that the first estimate of the in- 
ventory of modules in "V" is determined from re- 
sults of the above studies, from the overall ex- 
perience of the designers and with a necessary 
dollar constraint, 

it 

What is the character of modules which 
will permit physical re-organization in 
a short time without need for determin- 
ing resonances and removing common line 
interactions?” 

"Given a user with a problem what is the 
procedure which will lead to the design 
of a special purpose computer, to the 
specification of modules, their relative 
position and interconnection?" 

"What procedure can be established to 
check such organization rapidly?" 

"How can each new structure be described 



such that learning operates in the man- 
machine system, i.e. the next time sim- 
ilar substructures are needed it requires 
much less time from problem statement to 
use?” 

"How can we design a "cleared" state of 
the variable structure computer such 
that the variable inventory is inter- 
connected like a second "F"— since there 
are many problems for which the best we 
can do is to provide two computers op- 
erating in parallel on different sets 
of data?" 

"How can the supervisory control exer- 
cise its power without affecting the 
speed of operation of "F" and "V"?" 

"How can the computer system be used to 
aid the design of new organizations?" 

This list of questions can grow very long but 
it serves to display the directions of our thinking 

One might expect to find questions of inter- 
locks and queueing appearing strongly in a parallel, 
system like this. However it should be repeated 
that the focus of this system will be extending 
computability of individual large problems by de- 
tailed reorganization of the computer inventory 
into parallel special purpose substructures. Hence 
the parallel processing is a much more determinis- 
tic arrangement than those systems which seek to 
optimize the handling of many independent problems. 

IV. Conclusion 

0 

The UCLA development described above is at an 
early stage. However we hope to present a first 
design in one year so that we may make a proper 
target for criticism. We are firmly convinced 
that when a special purpose configuration may be 
accomplished using available facilities, a new le- 
vel of inventiveness will be exercisable. 
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HORIZONS IN COMPUTER SYSTEM DESIGN 
By 

Walter F. Bauer 
Intellectronic s Laboratories 
Ramo- Wooldridge 
Canoga Park, California 


Summary 

"Computer system design", or using 
the new phrase "computer organization design", 
has not received nearly the attention that the 
design of circuitry and components has 
received. Circuitry and component develop- 
ment tend to provide the user with higher 
operating speeds at lower cost but do not 
necessarily provide the user with a system 
easily adaptable to his particular needs. There 
is a great need for computers which can be 
adapted, as a total system, to the particular 
needs of the user. The changing needs of the 
user result from considering the computer as 
one part of a larger information handling 
system where man-machine communication is 
of great importance. There are a number of 
trends in the organization of computers such 
as concurrent operation, modularity, control 
hierarchy, and integral interrogation and 
display, which help meet the needs of these 
more sophisticated applications.. Analysis 
shows some of the benefits of modular design. 

In addition to the design of the computers 
themselves, the analysis techniques for inves- 
tigating the application, are appearing. These 
involve queuing theory and the development of 
methodologies for information systems analysis 
and the evaluation of computer applications. 

Intr oduction 

It is interesting to note the phrase 
"computer organization" in the title of this 
session. For some time now, those of us 
interested in the broader systems aspects and 
user aspects of computer design, have been 
using the phrase "computer system design". 

T o many, however, the phrase "computer 
system design" still has the connotation of 
circuitry and component design or even that 
branch referred to as "logical design". 

Although Webster defines "organization" and 
"system" with phrases that have identical 
meanings, to many the phrase "computer 
organization" will have a different meaning 


than "computer system". The former clearly 
refers to broader systems aspects of 
computers such as instruction repertoire, 
communication with magnetic tapes, systems 
expandability, etc. Perhaps "computer 
organization" will stick; at any rate, the word 
"system" is slightly less overworked. 

The thesis of my talk is that insuf- 
ficient time and energy has been devoted to 
the analysis of computer organization; the 
developments in organization have been over- 
shadowed by developments in circuitry and 
components. There have been some recent 
developments, however, in computer organiza- 
tion which indicate growing emphasis in this 
direction. It is my plan to review some of 
the changing needs, and the trends in computer 
organization to meet these needs, and to 
further discuss certain techniques in the 
analysis of information handling systems. 

The Status of Computer Organization 

In the general design of computer 
systems, three areas of activity come to 
mind: circuits and components, programming, 
and systems design or computer organization. 
There have been noteworthy developments in 
circuitry and components, many of which 
could be classified as breakthroughs: the 
development of large-scale magnetic core 
memories, the development and use of the 
transistor, the use of printed circuitry, 
and computer automated design techniques. 

We are in the midst of still higher speed 
circuitry developments with microwave 
techniques and nanosecond speeds becoming 
discussed more frequently. Similarily, there 
have been somewhat less noteworthy, but 
significant, developments in programming: 
sophisticated compilers, formula translators, 
complete programming systems for automatic 
computer operation; more recently we have 
seen the development of problem oriented 
languages such as COBOL (Common Business 
Oriented Language) and programs for 
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translating between various computer languages. 

But the area which has, from my point 
of view, received far too little attention, and 
the area which we are discussing here this 
morning, is computer systems design or 
computer organization. Until approximately 
1958 there was little progress in system 
design. There had been a gradual improvement 
in instruction logic with index registers becom- 
ing common and certain concepts like the 
indirect address, coming into use. The idea 
of buffering data between the input - output was 
advanced and the interrupt was adopted as a 
powerful technique for matching the computer 
approach to asynchronous external devices. 
While these advances are significant, they do 
not in any sense compare with the giant steps 
taken in circuitry and component development. 

Happily, during the period since 1958, 
a number of significant advances have been 
made. The idea which now appears as obvious, 
of having a computer operate in a truly parallel 
fashion was first embodied in the GAMMA 60 
and the LARC computers. The designers of 
these machines saw the possibilities of increas- 
ing speed through having large parts of the 
computer operating simultaneously rather than, 
or perhaps in addition to, increasing circuitry 
speeds. While it cannot be proven that 
parallelism or concurrent operation is more 
economical than higher speed circuitry, I 
believe that few would argue that increased 
parallelism can bear fruit and that it has not 
been pushed far enough or fast enough. 

We are on the verge of other break- 
throughs in computer organization. The concept 
of modularity coupled with sophisticated intra- 
machine communication, such as that found in 
the Ramo- Wooldridge RW-400 computer, will 
undoubtedly receive a great play in the future. 

Application Horizons 

For the purposes of our discussion here, 
consider the applications field as being divided 
into three parts: computing systems, control 
systems, and information systems. "Comput- 
ing systems" here refers to the mechanization 
by electronic computers of processes which 
are being done or can be done semi -automati- 
cally or manually. For example, partial 
differential equations were solved for many 
years by manual means. Similarly, business 


data processing was performed by a variety 
of machines before the main bulk of the 
activity was taken over by the electronic 
digital computer. Computing systems 
constituted the first application phase of 
electronic computers; this application will 
continue to grow and flourish and is, and 
will remain for many years, the major 
application area of digital computers in terms 
of dollars spent. 

The "control systems" applications 
area of electronic computers is a much newer 
one. Perhaps the first notable control system 
was the Air Force's SAGE System for Air 
Defense. Other systems such as refinery 
process control are appearing. This 
applications area is possibly best character - 
ized by saying that it provides closed loop 
control. It frequently requires specialized 
input - output for the computer, and seldom 
requires human intervention -- the human only 
serves to monitor the process. Indeed, an 
important characteristic of control systems 
application of computers is that the human can 
not carry out the process through manual or 
semi-automatic means because of accuracy 
requirements, speed requirements, and the 
requirement for reliability in spite of 
monotony. 

But the newest and most exciting 
applications area for electronic computers 
is the so-called "information systems". 

The lines of definition between the three 
applications areas are not clear cut and it is 
difficult in some cases (and unimportant) to 
decide in which area a particular application 
belongs. However, information systems are 
probably best characterized as follows: 

1. The computer is imbedded in a 
system involving data receipt 

from many sources, information processing, 
information display, and information 
dissemination. The computer itself is a 
relatively small part of the operation. 

2. Automatic data inputs from remote 
sources are part of the system. 

Data may be control data or data to be 
processed. The familiar digital data link 
involving wire communications or radio 
communications is employed and speeds up 
to 5500 bits per second are achieved. 
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3. Man-machine relationships are of 
the greatest importance. Equip- 
ments for the display of information are an 
integral part of the system as well as equip- 
ments to interrogate the memory of the 
computer. Virtually all the memory of the 
computer is available to analysts or operators, 
and many analysts can consult the memory or 
use the computer facility simultaneously . 

4. Processing is characterized by 
the processing of independent 

requests for service. The ratio of tolerable 
delay to processing time is low as compared 
to most applications of computers for scientific 
and business purposes. 

5. The requirements for reliability 
are very severe. Electronic 

failures which cause the complete outage of 
the system, are intolerable. 

6. The information system must have 
the provision for growing as an 

integral system. This growth must take place 
in an orderly way and without lengthy periods 
of inoperability during change over. 

7. The system must respond to a 
large variety of requests and it 

must adapt itself to the particular processing 
requirements of the instant. 

Any information system may have these 
characteristics to varying degrees. In some 
cases, some of the characteristics are absent 
completely. For example, the requirement 
for absolute round-the-clock reliability may 
be absent. Another comment is in order: 
frequently these so-called information systems 
perform closed loop control of devices in 
addition to providing information to which humans 
react. In other words, the definitions for the 
various applications areas here are suggested 
to provide general guide lines and not absolute 
lines of demarcation. 

Needless to say, information systems 
is the newest and most sophisticated applica- 
tions area for electronic computers. Consider 
for example the use of interrogation devices 
and display devices in connection with computers. 
They are almost non-existent today. However, 
they are beginning to appear as standard pieces 
of computers and the possible uses of these 
devices are appearing in numerous places, 


frequently in unexpected areas. 

Quite possibly the market for 
information systems will dwarf all other 
market areas eventually. The road ahead, 
however, is not without its serious obstacles. 
Programming techniques must be developed 
much further to allow the efficient communi- 
cation between man and machine that is made 
possible, at least, by hardware inquiry 
stations and display devices. The automatic 
report generator techniques in computer 
programming are a beginning step in this 
direction. Also considerably more work must 
be done in the overall systems analysis to 
make good use of computers. It has been 
frequently stated that computers attached 
with automatic data input equipment, display 
and interrogation equipment can be used by 
corporate top management to conduct their 
daily busines s and more specifically, make 
computer-aided decisions. Operations 
analysis and decision theory must be developed 
considerably further before such uses of the 
equipment can be made. 

Computer Organization Trends 
Computer Adaptability 

Information systems, however, give 
rise to a different set of requirements for the 
customer than has been previously seen. 

Almost all of the needs can be summed up in 
one short comment: There is a need for 
computers which can adapt to problems. Up 
to this time, computer customers have found 
it necessary to adapt their problems to the 
computers; the computer is purchased as an 
entire off-the-shelf system and the customer 
does as best he can to make his problem 
conform; gradually the problem fills the 
computer and the customer starts all over 
again worrying about the next phase. 

In the never ending search for speed, 
many of the requirements of the customer -- 
many of which now seem obvious and funda- 
mental -- which would make the computer a 
saleable item, were neglected. Everybody 
wants to sell computers but it seems that 
nobody wants to design computers attractive 
to buyers . 

As stated above, the computer must be 
adaptable; it must be adaptable to the particular 
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characteristics of the application, it must be 
adaptable to the requirements which may change 
from minute to minute or from millisecond 
to millisecond, and it must be adaptable to 
changes which occur throughout the years. 

Adaptability can undoubtedly be best 
achieved through modularity; the two thoughts 
are almost synonomous. Having modular 
components which can be added {or deleted) 
as the application changes provides the long 
term adaptability. Adaptability on a micro- 
second-millisecond basis is provided by 
connecting the modules by high speed switching 
e quipme nt . 

Instruction Repertoires 

Adaptability to the requirements of the 
user can appear in many places. One of these 
places is in the instruction repertoire. Some 
advances in computer instruction repertoire 
have been made during the last few years as I 
noted previously. However, techniques do not 
exist today to determine in some quantitative 
way, the power of a set of instructions as 
applied to a given information processing 
problem. Perhaps in this connection one should 
borrow an idea from the mathematical physicist. 
Consider treating a given application as an 
abstract vector space which is "spanned", as 
the mathematical physicists say, by the set of 
computer instructions. The set of instructions 
can be considered to be "independent" according 
to some definition of "independence" and as is 
done in theoretical physics in the case of a set 
of vectors. The set of independent vectors is 
"rotated" to minimize the "energy" required to 
"span the (problem) space". In other words 
the set of instructions should be chosen to be 
minimal from the standpoint of the processing 
required and with respect to a particular 
problem application. 

A possibility in instruction logic is the 
idea of building computers which can change 
their form through changing the set of instruc- 
tions they provide. This technique is frequently 
referred to as "micro-programming" and has 
fallen into a state of considerable disrepute. 
However, there are many good reasons to 
exhume the remains. The idea will look 
promising with the following provisos: if 
the programming is not burdened unduly by a 
new set of details, and if the computer can 
operate at nearly the same speeds as its wired- 


in counterpart, and if criteria can be 
e stablished for the intelligent choice of 
computer instructions such as is suggested 
in the above paragraph. A possibly attractive 
idea is to build a basic computer package for 
marketing, and design it so that the buyer or 
seller can "particularize" it for a given 
application. 

In a session* similar to this one at the 
E JCC in late 1958, the fact was deplored that 
instruction repertoires were not oriented 
toward automatic programming and translation 
between computer languages. Unfortunately 
nothing has happened to make that indictment 
less justified. 

Information Display and Interrogation 

Man-machine communication is under- 
going important changes. These changes will 
probably result in most large scale computers 
being designed from the outset to include 
display and interrogation consoles. Our first 
computers, and most of our present ones for 
that matter, allow the display of perhaps 100 
lights to signify the state of the computer or 
to provide information to the user -operator . 

In most cases, approximately the same number 
of on/off switches are available to communicate 
with the computer. Of course, these computers 
were not specifically designed with sophisti- 
cated man-machine communication in mind. 

Our modern computers designed for a tight 
man-machine communication loop involve 
the presentation of thousands of bits to the 
user-operator in assimilable form while 
allowing bim communication switches in 
roughly equal numbers. With most systems 
currently in use, information in the form of 
printed copy comes from the machine; the 
user must usually wait hours -- and frequently 
days -- before the new set of information, 
which he requests as a result of what he has 
just previously learned, can be made available. 
Not so, of course, with the new display- 
interrogation operator consoles such as those 
found on the RW -400 computer. 

Computer Control Hierarchy 

Figure 1 depicts a technique increasing 
in popularity in computer design and computer 
use -- that of establishing a hierarchy of 
computer control. A possible result of this 
"status seeking" of computer operations is 
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shown in the figure. The Systems Management 
function is the highest level of control; it would 
be responsible for the systems and procedures 
of the operation and the problem priority 
requirements. If, for example, a certain class 
of problems were not getting the required 
service, this level of control would change the 
priority of that problem class. The Systems 
Management control level would perform 
utilization analyses which would provide 
information as to possible equipment changes, 
and it would provide summaries and records of 
the way in which the computer system was used. 
Under this hypothetical model. System Control 
would be the next control level and would be 
responsible for the assignment of equipments to 
do a specific problem and for executing the 
priorities which were imposed at the higher 
control level. It would react to stimuli from 
external sources such as digital data links and 
the depression of keys at an analyst's console, 
it would initiate much of the processing 
required when problems are interrupted, and 
it would terminate problems as required. At 
the Problem Control level, macro -instructions 
would be interpreted and the management of 
memory would be performed; that is, decisions 
would be made here as to where data is to be 
stored, and how and where it is to be transferred. 
At the Problem Control level, the reactions to 
internal stimuli would be handled such as the 
interruption of the problem due to lack of data 
or the onset of an overflow condition. In 
addition, at this level the management, 
programming logistics and subroutines would 
be performed. The last two levels of control. 
Programming Logistics and Data Processing 
and Equipment Control, are the usual kinds 
of computer control which we have been doing 
many years. It might be added that a still 
lower level of control could be identified, 
namely, "micro programmed control". 

Whether the control as indicated here 
would be carried out by hardware or through 
programming is uncertain. Probably most of 
the control would be effected through program- 
ming although this control is made possible 
through hardware . For example, the interrupt 
signal enables the reaction of the computer to 
external asynchronous occurrences. It 
behooves the computer designer to seek ways to 
make this control hierarchy easy to achieve 
through programming. 

The functions of this hypothetical 


control model could be carried out in a time- 
shared way by one serially operating computer 
or, on the other hand, by computing and 
control elements working simultaneously and 
to the extent indicated, independently. 

Probably the latter technique will be found to 
be more effective; there will be a trend toward 
physically separate units performing the 
indicated control functions. 

The importance of a hierarchy in 
computer control is that it gives the computer 
a self -organizing character. With such a 
hierarchy the computer can be introspective 
much as the human is, and can continually 
monitor the course of its business to 
optimize its operations. 

Modularity and Reliability 

Some of the benefits of a modular- 
designed computer have been referred to 
above in discussing computer adaptability. 
Another advantage of modularity is that it 
brings about greater reliability at a lower 
cost. 

In Figure 2 there is shown a diagram 
showing the relationship between modularity 
and reliability. On the abscissa is the degree 
of modularity of a computer system defined 
as 100 minus the percentage of the total 
represented by the largest module. If, for 
example, the computer was made up of 10 
modules, each of which was 10% of the total, 
the degree of modularity in this case would 
be 90%. The ordinate here refers to the 
extra equipment which is needed to provide 
a level of reliability equal to that of duplicat- 
ing an entire computer system. The solid 
curve shows the amount of extra equipment 
needed to provide the same degree of 
reliability as obtained by doubling up on the 
amount of equipment in the case of the 
computer with zero modularity. It is assumed 
that all modules have the same reliability. 

There are many vagaries in interpret- 
ing a graph of the kind shown in Figure 2, 
and the results there are meant to be more 
qualitative than quantitative. However, it can 
be shown through simple mathematics that 
the computer 90 per cent modular requires 
only 30% extra equipment to get the same 
reliability as duplicating the entire single 
computer with no modularity, where the 
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assumption is made that the modules are each 
no more reliable than the entire computer 
without modularity. The dashed lines showing 
the total equipment refer to a rough estimate 
of the equipment needed plus a rough estimate 
of the additional equipment needed for 
communication among the modules. It seems 
clear that as the modularity proceeds closer 
to 100% -- that is, closer to the transistor or 
diode level -- that the equipment needed for 
communication becomes very great, and 
possibly unbounded in the limit. Experience 
plus some analysis shows that there is probably 
a minimal point lying somewhere between 80 
and 95% degree of modularity. 

Computer reliability, modularity, and 
control hierarchy, incidentally, team together 
in an important way. One of the higher levels 
of control, probably System Control of 
Figure 1, monitors equipment failures . When 
failure occurs the module not in use is switched 
in. Or, if all modules are in use, the lower 
priority problems are temporarily put aside. 

In other words, catastrophic failure does not 
occur; the system automatically adapts itself 
to carry on the processing with only slightly 
degraded performance. 

Computer Memories 

Progress in the system organization and 
use of high speed memories is particularly 
lacking. We have made memories bigger and 
faster, but that is about all. Some discussion 
has taken place of the virtues of small ultra - 
high speed memories for "scratch-pad" use, 
and it has been adopted in a few cases. A few 
possibilities for new uses have been advanced^ 
but there has been no embodiment of anything 
organizationally or logically new and, more 
surprisingly, no analysis has been performed. 

Referring to Figure 3, there is a remark 
that could be made about the duty cycle of large 
scale memories. Probably the curve of memory 
usage vs memory size for large scale memories 
is similar to that shown in the figure. It 
certainly is true that the absolute requirement 
for 30, 000 words of storage occurs far less 
frequently than the requirement for 4, 000 words 
of storage. It is realized of course, that this 
is a function of the problem and a function of 
the programmer's tolerance of red tape 
programming logistics. However, since 
32, 000 words of memory are on hand in the 


hypothetical example, there is an extra 
capacity as represented by the part of the 
rectangle above the curve. This extra 
capacity is roughly proportional to the excess 
capacity for the memory device in excess of 
the "theoretical" amount needed as 
delineated by the curve. Therefore, it seems 
that if the computer were designed to share 
this memory with other problems and if a 
system could be devised to switch memory 
assignments, the duty cycle of the memory 
could be greatly increased and efficiency 
would accrue. The curve and the large area 
above the curve suggest that a considerable 
fraction of the cost of the memory could be 
spent in the hardware to effect a sharing of 
the memory and still a saving could be 
achieved. 

Information Systems Analysis 

There was reference above to the need 
of computers which adapt to problems. The 
question immediately arises, however, as to 
deciding what the needs of the problems are. 

It is clear that if information systems are to 
take their respected place in our scientific 
world, we must develop ways of analyzing 
them. 

Queuing Theory 

One of the disciplines which might be 
applied is queuing theory. It was stated above 
that information systems are characterized 
by a low ratio of tolerable delay to process- 
ing time for a service request. It is this 
characteristic which makes a queuing theory 
approach possible. A non -mathematical 
statement of the problem would go something 
like this: service requests for the processing 
of data arrive on a random basis and it is 
desired to know the minimum equipment 
necessary to provide the servicing of these 
requests while meeting system performance 
specifications stated in terms of maximum 
tolerable delay or average delay. 

Figure 4 shows a curve which relates 
the processing speed of a computer system to 
the average delay in processing a request. 

The curve is similar to that given by Ackley^ 
but the variables have been normalized so as 
to make the results generally applicable. An 
important implication of this curve is the 
following: the average service demand rate 
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in almost all applications goes up as the system 
develops. If the processing speed does not at 
the same time go up, the ratio given by the 
abscissa is reduced and the average delay 
increases as the curve shows. Obviously then, 
there is a requirement to have the computer 
designed so that its processing speed can change 
according to the demand for service so as to 
keep the average delay within tolerable bounds 
and still not have a period of time when the 
system has excess capability and excess cost. 
Modular computers provide exactly this ability 
to increase their processing speed. Computing 
elements can be added to the system to keep 
pace with the demand and to keep, consequently, 
the average delay bounded. 

Information Systems Design Methodologies 

In the design of information systems it 
is important to develop techniques for 
systematically fitting the computer require- 
ments to the operation requirements of the 
system. 

Figure 5 shows a method for carrying 
out such a design. The operational require- 
ments give rise to the identification of certain 
data types, volumes of data, the processing 
required, and the distribution of requests for 
service. Together they define a problem. If 
an initial assumption is made as to the speed 
for processing each one of these types of data, 
the entire process can be analyzed. Probably 
the technique used here is simulation by large 
scale digital computer. This gives rise to 
information on the average length of delays 
which, when compared with the delay require- 
ments imposed by the operational character- 
istics of the system, will determine whether the 
service is tolerable or not. If the service is 
intolerable, then assumptions must be changed 
as to the speed with which the processing is 
performed. Having determined that the 
service is tolerable, and having determined 
the speed requirements, an analysis can be 
made to determine the complement of 
equipment necessary. After this is done, the 
total system is analyzed in view of program- 
ming requirements to determine whether the 
design is in all respects acceptable. Possible 
imbalances in the service or in the amount 
of equipment can result in a further analysis 
and a change in the equipment complement. 

An analysis was performed at Ramo- 


Wooldridge similar to this and was reported 
by Rothman^; the results are shown in 
Figure 6. The problem mix gives the 
characteristics of the problem. For example, 
50% of the problem required servicing 
requests which had to be completed on an 
average of 0. 5 minutes and with a total 
processing time of 1. 5 minutes. Poisson 
distributions were assumed and the frequency 
of arrivals is given essentially by the 
abscissa in terms of the average number of 
requests arriving translated to the number of 
computers to handle this average load. The 
zero per cent curve shows that if the number 
of computers equals the average load, then 
there is a zero probability of servicing the 
problem mix. As an excess of computers is 
applied, service improves as the results 
indicate. 

In many cases, however, the queuing 
theory approach is not applicable. There is 
a need for a more general approach to 
evaluating the applicability of a computer to a 
given problem. What is sorely needed then is 
a technique for developing the important and 
significant characteristics of problems and, 
likewise, the important and significant 
characteristics of computers. 

The general thought is portrayed in 
Figure 7 where, starting with the general 
problem characteristics and the general 
computer characteristics, a problem model 
of general applicability and a computer model 
of general applicability, results. (The author 
is indebted to S. Rothman of Ramo-Wooldridge 
for stimulating discussions on the subject.) 

If a specific problem is described in terms of 
this general problem, and a specific computer 
is described in terms of this general computer 
module, the theoretical performance of that 
computer involving the particular problem 
would be obtained. If the model were deemed 
sufficiently valid, the results then at this 
point would be useful. In the development of 
the methodology however, it would probably be 
desirable to compare the theoretical 
performance with the actual performance, 
providing that the specific problem had been 
already solved by a specific computer. By 
making this comparison one determines what 
problem model changes and what computer 
model changes are necessary. These then can 
be sent back to improve the problem model 
and the computer model. 
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Conclusion 


That there has been little progress in 
computer system design or organization stems 
from the fact that computer technology is 
relatively new. That excuse, of course, ages 
rapidly. A contributing factor is that most 
commerical computer manufacturers are 
inherently conservative, and see increased 
circuit speed as the only solution. Unfortu- 
nately other groups which are in a position to 
promote progress, have not always been 
sufficiently perceptive . However, "right is 
might", and the fruits of these approaches 
will most likely become increasingly manifest. 
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Summary 

The main objectives of this paper are to 
present a multi-level file structure for natural 
or textual language, and a computer algorithm which 
logically associates elements of the presented 
file structure. These objectives are met by 
showing how the formalized data is mapped into a 
vocabulary that is amenable to automatic computa- 
tion. This vocabulary consists of a set of in- 
dices which retain the syntactual association of 
the inputs. These sets of indices constitute a 
file organization which employs an associative 
list- type structure of the type utilized by 
Newell, Simon and Shaw in their Information Pro- 
cessing Language. 1 

The problems of natural language input and 
how they are surmounted are described. The de- 
tailed multi-level file structure can be used 
with systems whose input includes textual language 
that may be structured or formalized into an 
"attribute-attribute value" format, whose accuracy 
and/or reliability is questionable, and which is 
received in disjointed elements that have to be 
synthesized into meaningful associations. 

I. Introduction 

This paper is concerned with an application 
of digital computers to the field of information 
processing. In particular, we shall restrict 
ourselves to the processing of highly formalized 
information specified in a natural language, 
exhibiting characteristics which prove it a source 
of data for automatic development. Although the 
automatic development of information is restricted 
to formalized data alone, information of a non- 
formalized nature which relates to the formalized 
data can be retrieved also by these techniques. 

In the first portion of the paper, a multi- 
level file structure to accommodate the process- 
ing of the information is developed. An appli- 
cation utilizing this file structure is then made 
to illustrate its usage in the retrieval and 
association of data already existing in the sys- 
tem. 

The work reported on in this paper was 
sponsored by OACSI, the Office of the Assistant 
Chief of Staff for Intelligence, Hq. Department 
of the Army* The work is part of a study nick- 
named Project ACSI-MATIC, concerned with the 
potential uses of modern data- processing equip- 
ment and procedures in the activities of certain 


headquarters military intelligence operations of 
the U.S. Army. 

The nature of the problem can best be char- 
acterized by describing it as one in which the 
processing of free language plays an extensive 
role. This is a consequence of the fact that the 
basic input to the system is in the form of re- 
ports and documents in which pertinent events and 
conditions have their statement largely in a nar- 
rative language. In the computer approach which 
has been proposed as a solution, this narrative 
characteristic of the input is formalized in or- 
der to take advantage of the computer 1 s ability 
to process information. 

A prerequisite for the utilization of this 
ability of the computer is the necessity to elicit 
formalized information from the input. The sub- 
stance of the information in the input was sub- 
jected to a detailed investigation to determine 
whether or not a formalized structure could be im- 
posed upon it. A set of formal categories was 
determined in the consideration of the body of in- 
formation constituting input to the system. These 
categories are descriptive of the major elements 
which the system encompasses and have relation- 
ships among themselves which reflect similar re- 
lationships between the elements which they repre- 
sent. The elements enter into relationships of 
synonymity and hierarchy which the system must 
establish and confirm. 

The specific set of categories chosen are 
termed descriptors of the system. A particular 
instance in a category or descriptor class has 
been termed a descriptee . There is a set of des- 
criptor classes which has the special feature that 
a single descriptee in any one of these descriptor 
classes uniquely identifies a major element of the 
system. Any descriptee from a descriptor class in 
the special set has this property. An associative 
relationship can be established for any set of 
descriptees in these special classes which denote 
the same element. The special descriptor classes 
having this feature are called biunique descrip- 
tors, and the descriptees included in them are 
called biunique descriptees . 

With the descriptor classes explicitly 
formulated, the descriptees can be recognized and 
extracted from the input documents. An input 
specialist then assembles these descriptees into 
modules called multiple ts which form the basic 
source of material for automatic processing in the 
system. These multiplets are stylized aggregates 
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of descriptees wherein the descriptor classes 
represented provides an index to the special as- 
sembly of subroutines in the internal system which 
the multiplet requires in order to provide for its 
full development* This is a consequence of the 
fact that relationships between the descriptees in 
the multiplet have been predetermined and are de- 
pendent upon those descriptor classes represented, 

A typical multiplet of information to the 
system would be as follows t (D, xj_; G, X2; L, X3), 
It reads that the descriptee of the descriptor 
class D is related in a unique way to the des- 
criptee X2 of the descriptor class C and is loca- 
ted at X3 (the location descriptor class is indi- 
cated by L), 

When the multiplets of desciiptees are ex- 
tracted from an input document there usually re- 
mains a residue of information which has perti- 
nence to the problem, which must enter into the 
system. These fragments, as they are called, do 
not participate in automatic processing within 
the system, but serve as a background against 
which the associated multiplets can be better de- 
veloped, The associated multiplet provides a co- 
ordinated index to this fragment, A basic feature 
of the system is this relationship between multi- 
ple t and fragment and the manner in which it is 
preserved. 

As an introduction to the file structure, 
the automatic development that takes place with 
respect to an input multiplet will be briefly dis- 
cussed, There are two stages of development of 
the multiplet. The first stage develops and 
standardizes the descriptees within the multiplet, 
independent of the other descriptees. The second 
stage utilizes the specific combination of ele- 
ments comprising a multiplet. 

The first stage of development is best il- 
lustrated by the following simple example: The 
desired machine input when locations are mentioned 
might be coordinate, city, and state, if a city is 
mentioned. However, inputs to the system might 
contain a coordinate and city without mention of a 
state. It is therefore necessary to develop the 
descriptee and standardize it before further pro- 
cessing commences so that at a later date one can 
retrieve the information under the hierarchic lo- 
cation term “State*', Unless the data is initially 
developed before decisions are made about it, both 
information processing and information retrieval 
might be difficult and time consuming, if not im- 
possible, The adequacy of information retrieval 
is to a large degree determined in the input stage. 

After initial development and standardiza- 
tion of the information contained within the mul- 
tiplet, it is then possible to process the data' 
against the existing information. The system 
logic, the content of the files, and the input in- 
formation define the processing that takes place 
within the formalized files. In a sense each 
multiplet determines its own processing through 
the system. The objective of processing 


multiplets is to categorize the information with 
respect to all other information within the files. 
To categorize the information a substantial amount 
of retrieval and association of data is often re- 
quired. Another portion of the system logic 
specifies whether or not attempts should be made 
to search for, retrieve, and determine associa- 
tions between the input item and the file infor- 
mation. If it is so specified and associated in- 
formation exists, the machine logic augments the 
input multiplet with the associated data and at- 
tempts to categorize this new multiplet (testing 
for consistency, completeness, etc.). Rules are 
specified whereby the machine can automatically 
resolve certain situations of a general nature. 

In clearly defined and precise situations the ma- 
chine simulates man's intelligence. 

Throughout the system man plays an important 
role in analysis and decision making required 
with respect to the data. Based upon the cate- 
gorization of the information the man must speci- 
fy what the ultimate conclusion of the machine 
processing should be. At the present stage of 
development the machine logic only determines the 
consistency consequences of decisions, but it is 
dependent upon the man for the final disposition 
of the results. 

The ability of the system to logically as- 
sociate information is important for two reasons. 
In the first place, by associating the informa- 
tion in the multiplet with other multiplets, it 
might be possible to relate terms that previously 
did not appear to be related giving man the capa- 
bility to retrieve information that he would not 
have been able to obtain had the multiplet been 
disassociated. Secondly, forming associations is 
useful for systematically purging data from the 
file of information to automatically retire data. 
This is an effective way to keep a dynamic system 
from growing beyond the bounds of computer stor- 
age media. Retirement of outmoded data is the 
only long-range effective means of maintaining an 
efficient system. 

II, The Multi-Level File Structure 

The multi-leveled file structure accommo- 
dates information at various levels of formaliza- 
tion and authenticity. The more highly organized 
information is separated from the incomplete or 
unauthenticated data and is available as a guide 
to the other information. One system objective 
is to elevate information from the unauthentica- 
ted class to the authenticated class. The abil- 
ity to form associations facilitates this func- 
tion. This structure can be generalized to ac- 
commodate systems where information entering it 
has the following properties r firstly, it can 
be formalized in the sense described above ? sec- 
ondly, it is questionable in regard to its ac- 
curacy and reliability; and finally, it has the 
property of being received in disjoint elements 
which have to be synthesized into meaningful 
associations that will yield maximum information 
content. The following description, although 
oriented towards one particular problem. 
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represents a file structure applicable to a broad 
class of systems. 

Hie multi-level file structure has these 
components t 

1. Glossaries 

2. Index Files 

3. Buffer Files 

U. Information Files 

a. Authenticated Files 

b. Intermediate File 

The Glossaries are a series of files which 
serve first to standardize and develop the various 
alternatives which occur for descriptees associa- 
ted with certain descriptor classes. Certain 
descriptor classes will have Glossaries associated 
with them. The incoming multi pie ts are initially 
processed into the system through these Glossaries 
and are rendered into a vocabulary amenable to the 
computing system. In addition to this function 
the Glossaries may serve as Index Files (index 
Files are explained below). Glossaries are or- 
dered in a natural sequence depending upon the 
class. 

Each descriptor class has an Index File as- 
sociated with it unless a Glossary is utilized in 
this capacity. These files are essentially lists 
of descriptees segregated into their respective 
descriptor classes. Index Files represent a 
second level of file organization and are uti- 
lized as directories to the Information Files, 
that is, to authenticated multiplets in the Au- 
thenticated Files, and to the unauthenticated 
multiplets and the non-formalized data in the In- 
termediate File. The latter is accomplished by 
associating with each descriptee a machine address 
wherein this information is located. This machine 
address constitutes part of the Buffer Files to be 
explained shortly. Each Index File is ordered in 
a sequence natural to it (alphabetic, numeric, 
etc.). The format of an entry in an Index File is 
(descriptee; reference to a Buffer Address). The 
subset of Index Files relating to descriptor 
classes containing only biunique descriptees is 
termed the Biunique Descriptor Index File and will 
be utilized later in the paper. 

The third level of files are the Buffer 
Files . To comprehend this file it will be neces- 
sary to specify what happens to a message when it 
enters the system. With every such message is 
assigned a unique machine address location, termed 
an index. This index refers to the location where 
the multiplet, auxiliary data, and -the non- formal- 
ized textual information (fragment) is retained? 
that is, the address of the item in the Inter- 
mediate File explained below. If as a result of 
processing, (either by man or machine), the in- 
formation is authenticated, the machine location 
of the authenticated information is also noted. 

The index, or the machine address of the multiplet, 
is placed in the Buffer File portfolio of each 
descriptee occurring in the multiplet. 

To be more explicit, the format of an entry 
in one of the Buffer Files is: (index? L]_, 12). 


As has been stated, the index relates to an Infor- 
mation File address of the multiplet. This is 
actually an address in seme storage medium; pre- 
ferably random access. Hie symbols L-, and 1^ 
represent links to the descriptee portfolio. This 
portfolio or list is maintained in order according 
to the index number. The link Ln , associated with 
the first index in Ihe list, is the address of a 
previous index in the descriptee portfolio; the 
link L2> associated with this index, is in turn 
the address of the next index in the portfolio. 

The list of indices is cyclic, i.e. the last in- 
dex in the list is linked to the first index. 
Therefore it is possible to retrieve an entire 
portfolio of indices by entering at the first ad- 
dress as provided by the particular descriptee, or 
more important, by entering at any point on the 
portfolio chain. Furthermore, the chain need not 
be given sequentially; it may be scattered through- 
out the file. This last fact permits a most im- 
portant and useful file structure, namely all in- 
dices within a descriptor class of the Buffer File 
can be sorted in ascending order with the index as 
a key. The actual sort may be accomplished by 
either listing these indices in sequential sorted 
order or by listing in random order using two 
links L3 and appended to the index. This file 
structure obviates a great deal of search time 
since the relative position of an index may be de- 
termined within a small area of the Buffer File 
and only this area need be searched for a particu- 
lar index. 

It should be emphasized at this point that 
in contrast to data of variable length contained 
within the Index Files, the entries of the Buffer 
Files are of fixed length. This allows the Buf- 
fer File entries to be utilized easily when re- 
quired in processing. As will be shown subse- 
quently, considerable usage is made of the indices 
contained in this file during the processing and 
association of data relevant to incoming messages. 

Hie Information Files are a set of files 
which contain modules of associated descriptees 
and related information wherein the associations 
satisfy certain specific system criteria. These 
files represent the main repositories of succinct 
and organized information about the descriptor 
classes of the system, and, as such, are the 
principle cource of data for the compilation of 
special summaries and for replies to interroga- 
tions. 

One system goal is to determine the authen- 
ticity of information. This has dictated a defi- 
nition of the Information Files into two distinct 
files? Hie Authenticated File (AF) and the 
Intermediate File (IF), if a multiplet is estab- 
lished (by some means) to be authentic, it is 
placed in the AF; otherwise it is placed in the 
IF. The Authenticated File contains, in addition 
to the authenticated multiplets, certain auxiliary 
information such as dates* 

The Intermediate File is the file into 
which is placed the entire input message, composed 
of multiplet, fragment, and any auxiliary 
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auxiliary information peculiar to the system 
(e.g., date of receipt of input, source of input, 
etc.) when the information has yet to be authenti- 
cated. The location where this input message is 
placed in the storage medium provides the index 
for use in the Buffer Index Files, as explained 
before. 


Usage of the Multi-Level File Structure 


The purpose of this section is to illustrate 
the usage of the multi-level file structure to 
determine associations that may exist between 
multiplets residing within the system. Suppose 
that it is desired to examine the textual material 
within the fragments in the Intermediate File re- 
lating to some descriptee or combination of des- 
criptees which appear in the Index Files. Given 
each descriptee, it is possible to retrieve its 
portfolio of Buffer File indices and utilizing 
some combination of the Intermediate File indices 
contained in these portfolios (e.g., their union 
or intersection) it is possible to retrieve the 
desired textual information. The file structure, 
although oriented towards formalized data, can be 
utilized to retrieve non-formalized data. The 
technique for retrieving formalized data from the 
Information Files is identical. 


The process of developing information to 
the extent that a more inclusive fact can be de- 
duced from more than one inclusive fact has been 
termed "cycle and string formation"* The process 
of determining cycles or strings depends upon 
whether or not biunique descriptees are contained 
in the raultiplet. Furthermore, the method re- 
mains applicable if more biunique descriptors are 
added to the system other than originally contem- 
plated* 

Cycles and strings are defined as follows: 
Two multiplets are said to be logically associated 
if they have at least one biunique descriptee in 
common. Then, if the multiplets can be ordered 
such -that the first is logically associated with 
the second which in turn is logically associated 
with a third, etc., a cycle of n constituent 
multiplets exist if the nth multiplet is logical- 
ly associated with the first. In a cycle a par- 
ticular multiplet is logically associated with at 
least two multiplets: the preceding and follow- 
ing multiplets. If n multiplets may be logically 
associated in a manner where a particular multi- 
plet is logically associated with only one other 
multiplet, a string of n constituent multiplets 
exist. ~ 

An example of each is shown in the figure 1, 
where the ABGD multiplet is logically associated 
with the ECFG multiplet thrcxigh the descriptee C. 
In like manner the EGFG multiplet is logically 
associated with the HFKL multiplet through the 
descriptee F. Likewise the HFKL multiplet is 
logically associated with the BHIJ multiplet, 
which in turn is associated with the ABCD multi- 
plet through the descriptee B. Notice that the 
loop BGHi is indeed a cycle in the graph- theore- 
tic sense — hence, the name. If the BHIJ multiplet 


were absent, the string BGF would be defined. 

Whenever cycles and strings are determined, 
the constituent multiplets describe the same sub- 
ject. It is therefore possible to integrate the 
multiplets into one multiplet. ill of the des- 
criptees in these constituent multiplets are not 
necessarily required for cycle and string forma- 
tion. It is possible for biunique, as well as 
for non-biunique descriptees of one multiplet, to 
contradict the corresponding descriptees in 
another multiplet. Those descriptees of multi- 
plets which are not required for cycle and string 
formation are termed the residual cycle or resi- 
dual string. 

Cycles and strings are to be formed using 
all multiplets relevant to a particular input 
multiplet. It is necessary that all multiplets 
relevant to the inpit raultiplet be retrievable 
or ire cons true table. In actuality, this is re- 
quired only for the biunique descriptees of the 
multiplets. To retrieve these multiplets di- 
rectly is time consuming, particularly since 
there may exist many levels of relevant multi- 
plets. To reconstruct the biunique portions of 
these multiplets is the interesting approach. 

It will be shown that this can be accomplished 
exclusively via the Biunique Descriptee Index 
File and Buffer Files. 

No matter how cycles and strings are deter- 
mined, the complete technique is a three phase 
operations 

Phase 1. Retrieve all multiplets relevant 
to a particular multiplet 

Phase 2. Form all possible cycles and 
strings 

Phase 3. Validate these cycles and strings. 
These phases shall be discussed in order after 
introducing the notation to be used. 

Assume first that all multiplets relevant to 
the input message have been retrieved. Let there 
exist a total of n biunique descriptor classes 
in the retrieved multiplets. Then for the jth 
biunique descriptor class let t j be the total 
number of biunique descriptees of the jth class 
contained in all these multiplets. Denote the 
ith descriptee in the jth descriptor class to be 
Dj_jj (l£ i^tjj lij^n). Now it is possible 
that a particular descriptee occurs in more than 
one multiplet. The list of indices to multiplets 
containing the descriptee D^ is denoted Ljj 
(l£ i^tjj l<s- j-?n). These lists of indices are 
the desired output of Phase 1 which is the input 
to Phase 2. 

The technique of retrieving these lists, 
which will be used in reconstructing the biunique 
descriptee portions of all multiplets, is ac- 
complished as follows: 

1, Examine the Index File for those bi- 
unique descriptees contained in the input multi- 
plet. For those that exist, retrieve the associ- 
ated Buffer File portfolios or lists of indices. 
Each portfolio will be found in that section of 
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the buffer file corresponding to the descriptor 
class of the associated descriptee. Add the ii>- 
dex of the input multiple t to these lists tagged 
with the appropriate Ly • 

2. Starting with the first index of the 
first portfolio, “ search" the other sections of 
the Biunique Descriptee buffer file for this in- 
dex. This required search utilizes the relative 
magnitude of the particular index and the sorted 
order of each section of indices; therefore, the 
,, search ,, time is comparable to “function table 
lookup” time. When an index is found, retrieve 
that portfolio in which it is contained unless it 
was retrieved earlier. Tag these portfolios or 
lists with the appropriate Ly . Since there can 
be only one descriptee for a particular descrip- 
tor in a given multiplet, the same index cannot 
be a member of more than one portfolio in any one 
descriptor class section of the buffer file. 
Therefore, there is no need to “search” the same 
section of the buffer file for indices contained 
in an associated Ly. 

Repeat this procedure for all port- 
folios determined. 

3. When all portfolios have been processed 
as in step two (2), all lists Ly of indices to 
relevant multiplets have been determined. Retain 
these lists as they are but merge all of the in- 
dices into increasing sequence, eliminating dup- 
lications to form a list, S of p indices. Order 
this list in increasing sequence. 

The required merge utilizes the sorted 
order of all relevant portfolios. 

U. For each descriptee list, 

Ly; (1^ i£ tj; 1 £ j £ n), form a corresponding 

binary number L* of p bits whose qth bit is 1 if 
id 

and only if the qth index in S is contained in 
Ly . Now consider the binary numbers Ly 
(1^ i— t^; 1- j-n). An examination of these 

n 

m = S t ; binary numbers placed such that 
j = l J 

their qth bits are in vertical alignment reveals 
a reconstruction of the biunique descriptee por- 
tion of that multiplet whose index is the qth 
element of S. Multiplets are reconstructed 
vertically and all multiplets containing the same 
descriptee are represented horizontally. 

This completes the first phase of the cycle 
and string process; that is, all multiplets rele- 
vant to a particular multiplet have been retrieved; 
furthermore, the biunique descriptee portions of 
these multiplets have been reconstructed. 

Phase 2, that is the formation of cycles 
and strings, will be accomplished by a logical 
product operation on the binary numbers L] . and 
the result used as a mask on S. The operations 
employed are now defined. 

a. Logical product, denoted by A , pro- 
duces from two binazy numbers a binary number 
whose qth bit is zero if and only if one or both 


of the qth bits of the two binary numbers are 
zero. 

b. Masking, denoted by M, produces from a 
list of p elements, S, and a binary number of p 
bits, a list of elements from S' which are in the 
same relative position as the non-zero bits in 
the binary number. Thus, for example, if we let 
the binary number mask S we obtain the list 
of indices Ly. Notati. onally this is written: 

Ly MS — L ij 

Then, the actual determination of cycles 
and strings is as follows: 

5* Perform: 

( L y A L^. s ) M S — ► S (i,j;r,s) for (l) 

(l^ritgj j+l*s*n; l*i£t y, l^j<n) 

where S(i,j;r,s) are lists of indices of all 
relevant multiplets containing both the ith des- 
criptee of the jth descriptor class and the rth 
descriptee of the sth descriptor class. 

6, If the results of the qth product-mask 
operation (1) are non-vacuous, S(i,j;r,s) is 
stored and its address is placed in the qth posi- 
tion of a list H, capable of holding 

n. n. 

2t K ( 2 ) 

j=l k =J+l K J 

memory addresses. If the results of a logical 
product is empty (no multiplets exist which con- 
tain both descriptees Dy and D rg indicated by 
(1) ), store a word of zeros (or some symbol) 
indicating this fact in the list H. Let the 
first element of H be h. 

Before continuing the algorithm, it is 
necessary to present the reasons for developing 
the list H and indicate generally how this list 
is used. Cycles and strings are determined by 
choosing a fixed descriptee for each descriptor 
class. If at least two of the chosen descriptees 
exist in one or mere multiplets, these multiplets 
are eligible to be constituent multiplets of a 
cycle or string. From this class of eligible 
multiplets, a cycle or string is formed depending 
upon which multiplets can be logically associated. 
All cycles and strings are determined by repeat- 
ing this process for all descriptee sets formed 
by choosing one and only one descriptee from each 
descriptor class. Since there are ty descriptees 
for the kth descriptor, each one of these des- 
criptors must be considered in combination with 
all of the other descriptees. Thus, there exists 

a 

II t j sets I v , which must be examined for 
i ~ i 

cycles and strings. Every set II.; (l£ v * n ty) 
n(n-l) j=l J 

has at most 5 sets of non-vacuous relevant 
indices S(i, j ;r,s). In all probability, these 

subsets will not contain the same number of items 
(or indices) for all values of (i,j) and (r,s). 
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One function of the list H is to obviate the 
search for particular subsets of indices 
S(i,j;r,s) in cycle and string formation. 

The existence of cycles and strings is de- 
termined in part by knowing if at least two bi- 
unique descriptees occur in at least one multi- 
plet* It is unimportant what the multiple ts are. 
In other words, cycle and string existence is de- 
termined in part by noting if the sets S(i,j ;:r,s) 
are non vacuous. The list H contains the ad- 
dresses of the non- vacuous sets S.(i,j;r,s ), and a 
••Void ’ 1 symbol for the vacuous sets of relevant 
indices. Therefore this list of constant size 
items is used to determine cycle and string exis- 
tence instead of the list of indices S(i,jjr,s ) 0 


It is noted in passing that this list of 

addresses could contain the binary numbers 

L . . A L instead of the addresses of the sets 
rs 

S(i,;jjr,s) obtained after masking the list S of 
all relevant indices by the particular binary 
number. Then the "void*' symbol is binary zero. 

7. Now continuing the algorithm, the 
existence of cycles and strings is determined by 
examining the addresses of the vth set of rele- 
vant indices I v . The elements of I v depend upon 
which particular descriptees are chosen for each 
of the n descriptor classes. Then, for the jth 
descriptor, a particular descriptee shall be 
termed D 3 . ,-jJ (l^ij-t^j 1- j -n). The set of 


address of I_ ( 1 ^ v ± ri t^), contained in the 
j = l J 

list H, is determined in a systematic manner by 
computing for j*l* 


|"(h-l)+ ij St y - St 2 + i s ] 
L Vj+‘ 2:iS J 


and for j > 1 t 


[(h-l)^S t w 2 t x 

L ul>= I x = tw-*i y = 1 a=s J 


( 3 ) 


for the range of values: ( J+l- s — n; 1 — j < n) and 


8 . Then, for the addresses determined for 

each I v ( 1 - v - nt^i 
j=l J 

A cycle exists if there exists a sub- 
set of these addresses such that every cor- 
responding (i,j) and (r,s) in I v appears at 
least twice. 

A string exists if the cycle criterion 
holds for all but two and only two of the 
(i,j) and (r,s). 


The constituent raultiplets of the cycle or 
string are in both cases those raultiplets whose 
indices are contained in the sets S(i,j;r,s) with 
the restriction that only one (but any one) index 
be selected from each set. If there exists more 


than one element in a set S(i,jjr,s) there exists 
more than one set of constituent raultiplets which 
form the same cycle or string. This completes 
the second phase of the cycle and string forma- 
tion algorithm* 

After a cycle or string has been determined, 
its authenticity is established in the phase 3 of 
the algorithm. This may be accomplished in part 
by examining the fragmentary information (dates, 
sources of information, etc.) associated with the 
constituent multiple ts* Furthermore, the validity 
of a cycle is strengthened by the non-existence 
and/or the resolvability of residual descriptee 
contradictions. Other tests of authenticity may 
be devised on the basis of multiplet content or 
information contained. If a cycle or string and/ 
or its associated integrated multiplet passes all 
validity tests, it must be processed through the 
system logic to determine Authenticated File con- 
tradictions. The final disposition of a cycle or 
string and its constituent raultiplets is dependent 
upon the particular application. 

It should be noted that all cycles and 
strings are determined by the given technique af- 
ter all relevant raultiplets have been identified 
and then reconstructed; thus, the first phase of 
the technique may be used as a device to retrieve 
all relevant raultiplets. More important, notice 
that the only time the narrative information was 
used occurred in determining the existence of bi- 
unique descriptees in the input information and 
in the final authentication of an existing cycle 
or string. Otherwise, all of the presented pro- 
cessing of information in cycle formation is ac- 
complished by use of only the indices in the Buf- 
fer Files. Thus, an important characteristic of 
the presented file structure is the abiliiy to 
associate information, accomplish certain types of 
processing, etc., using the fixed format indices 
instead of the narrative form of the information. 

IV. Conclusions 


It has been shown how the elements of the 
proposed information- processing system have been 
formalized in order that they can be incorporated 
into a machine system. A file structure has been 
established to encompass these formalized elements 
to provide for their full development. The for- 
malization of elements receives its ultimate ex- 
pression in the establishment of the Buffer File 
indices which have a fixed- field format, and thus 
lend themselves readily to automatic manipulation. 
These Buffer indices with the associated des- 
criptees provide a powerful tool not only for 
automatic manipulation of data, but also in its 
retrieval for manual manipulation* 

The stylization of the input data, as was 
shown, is but the first step in the complete 
formalization of information within the system* 

The ultimate consequence is the emergence of the 
multi-level file structure which it is believed 
will facilitate the development of the desired 
information. This is accomplished in a file 
structure that accommodates a variety of 



information in different stages of development 
and authenticity* Also, it is believed that in- 
coming information has been efficiently factored 
into terms which provide a full key which has 
reference back to the incoming data* These terms 
(essentially descriptees) become entries into the 
Index Files, The mechanism of the Buffer Indices 
insures a facile method for the reconstitution 
of these terms into aggregates which may even be 
more organized and revealing than those found in 
the original input. 

The manual aspects of the proposed system 
should not be overlooked. One of the manual fea- 
tures arises due to the complexity of the infor- 
mation involved; i.e. it may occur in one of the 
processing routines that an impasse arises because 
of unresol vable contradictions or lack of infor- 
mation, at which point, the machine signals for 
manual intervention. It may occur that the man is 
able to resolve the difficulty without further ac- 
cess to information within the machine, The full 
textual information in the form of fragments rele- 
vant to the situation within the Intermediate File 
is available for resolution if necessary, and is 
supplied to the man through the various interro- 
gation mechanisms available. In addition, every 
automatic process has decision points at which the 
process may signal the alternative it has chosen, 
and if the man disagrees with it, he may inhibit 
this alternative and pursue his own. It is be- 
lieved that the proposed system achieves a balance 
between a man and a machine, allocating to each 
the duties for which they have the best capability. 
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Figure 1» Representation of Cycles and Strings Formed by Multi plets 
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Introduction 

Not long ago, information activities were still 
considered merely incidental to the technical 
operations of industrial concerns, while such 
activities are now fast becoming a tangible 
portion of company efforts. The problems of 
information storage and retrieval increase even 
while solution methods are being studied. There 
is an urgent need for new techniques which will 
enable clerical help with machines to carry out 
searches, thus liberating engineers and scien- 
tists for work on technical problems. These, in 
turn, include the problems of documentation re- 
search . 

In order to organize the use of recorded knowl- 
edge, it is obviously desirable to: 

1. Classify all information to be stored; 

2. Correlate all facts to guide future 
dec i s i ons ; 

3. Select recorded information relative to 
a given problem. 

The development of a large-scale retrieval sys- 
tem will largely depend on the results of work 
in the classification of concepts rather than of 
words. There is little sense in classifying 
words to organize information because the irra- 
tional, natural language of a document is hardly 
ever the best systematic language for use in a 
machine. This difficulty is a serious obstacle 
to the accurate correlation of recorded facts 
and, by extension, to phrase-by-phrase trans- 
lation from one natural language into another. 
Since natural languages are rather inefficient 
tools for the symbolic expression of ideas, the 
methods of symbolic logic can be employed to 
analyze, formulate, and correlate these ideas be- 
fore they are used, "couched in the capricious 
imagery of words", so to say. 

Making use of the symbolic logic truth-matrix 
computer technique discussed in earlier papers 
I, 2, and of other logic methods, it is possible 
to construct decision-devices useful in the clas- 
sification, correlation, and selection of docu- 
ments, as well as in the machine translation of 
languages. The basis of the symbolic logic ap- 
proach is non-arithmetical . It features a very 
flexible matching ability. The approach, general 
with respect to machinery, does not suffer from 
the handicap of a searching program written ex- 
pressly for an existing special-purpose machine. 
The latter may operate by quas i -mathemat ica I 


processes, and entail long stretches of "house- 
keeping" operations. 

Another important consideration in the planning 
of an information retrieval system is its adapt- 
ability to expansion. It is one of the purposes 
of the symbolic logic methods to provide for the 
eventual embedding of the present system in one 
of far greater scope, preferably in nesting- 
block fashion. 


The approach taken is not to be confused with 
one limited to the use of the logic of collec- 
tions of messages without regard for meaning, 
it is here intended to do more than merely cata- 
log the frequencies of musical tones in hopes of 
having someone perceive a melody. 


The Proaressive Staaes Of Loc 


The techniques of mathematical or symbol ic logic 
which underlie the proposed approach to documen- 
tation research will be discussed only insofar 
as they go beyond the rudiments of this disci- 
pline as given in textbooks on symbolic logic or, 
in part, in I, 2. It is the object of the pres- 
ent paper to show how symbolic logic, as a tool, 
can be introduced gradually into the problems of 
information storage and machine searching and 
translation, just as the scope and the flexi- 
bility of the system developed are gradually in- 
creased . 


Seven two-propositional functors and 
denial (Figure I) are used to analyze 
the relationships of a body of infor- 
mation and to separate out various 
classes of factors. The relationships 
thus brought out might go unnoticed in 
the confusing form in which information, 
such as a large group of observational 
data, or a set of statements in a nat- 
ural language, is usually available. 

For instance, a chemist may encounter 
this: 

A white crystalline solid contains ni- 
trogen. Its melting point is II 4° and 
the solubility division is W. The aque- 
ous solution is neutral to litmus. 

When the solid is heated, it distils at 
222°; water is evolved during this dis- 
tillation. The distillate reacts with 
nitrous acid to form a water soluble 
acid that boils at 118°. 
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A logic statement corresponding to this 
description may be given as: 

(see Figure 2 for proposition-key) 

KKKNaKcNdCcPab 

KNge 

KKKi jkl 

Kqr 

KPonm 

Kuv 

K^z 

Note that for propositions k,l,m, and z 
table- lookup is indicated 

2. By means of the basic two-propositional 
functors, the truth-matrix technique of * 
and a method of encoding and matching 
through the characteristics (truth- 
values) of statements, a convenient and 
versatile basis is provided for search- 
ing, sorting and selecting operations, 
(see example below) 

3. Group-theoret i ca I relations^ among the 
character i st i cs can be used to simplify 
considerably the making of decisions. 
Tedious operations commonly performed by 
Boolean algebra are thus paralleled by a 
binary matching technique which makes 
possible the ordering of otherwise very 
unwieldy clusters of statements. 

Through this technique, simplification 
and normalization (i.e. transcription 
into the form using only and, or, not) 
become automatic. 

All the characteristics of a given num- 
ber of propositions form a finite 
Abelian group with respect to the opera- 
tion A. By comparing character i st i cs 
through the equivalence operation - I 
if a pair of bits is alike, 0 otherwise- 
it is possible to have the machine find 
group elements which have the simplest 
symbol equivalents. 

e.g., the expression 

VKVNPabcNVVNSabNSbcNSacKKabc 
has characteristic 01 I 0 I 0 0 I 
(Figure 3) 

Testing the characteristic of a: 

01 I 0 I 0 0 I characteristic 

0 0 0 0 1 I I I a 

1 0 0 I I 0 0 I by E-(equ iva lence) 

operation 

01 1001 10 inverse, (by N-opera- 

tion) this indicates 
the presence of the A- 
operat i on 

01 1001 10 bAc 

I I I I I I I I tautology proves this 


Hence, the expression simplifies to 

aA(bAc) 

or AaAbc 

or, in normal form, (since Axy is the 
same as VKNyxKNyx) 
VKNVKNcbKNbcaKNaVKNcbKNbc 

4. Probabilistic weighting factors can be 
admitted to provide for the statistical 
handling of information. For example, 
they could serve for machine-decisions 
concerning pertinency ratings of docu- 
ments retrieved. 

For example, a materials engineer may 
make the following request: What rubber 
material should be used in a certain 0- 
r i ng packing exposed to organic phosphate 
ester hydraulic fluid on the inside sur- 
face and atmospheric oxygen on the out- 
side surface, and able to withstand 225°F 
hydraulic system temperature and mechan- 
ical abrasion. 

There are perhaps 25 faipi lies of rubber 
materials against which to match the re- 
sulting assertion pattern. Some of the 
required properties may be obtained more 
generously than others from any given 
rubber material. It may be necessary to 
establish a preference order among dis- 
closures, since it would be desirable to 
obtain not only a material just fulfill- 
ing the requ i rements , but the best possi- 
ble one for the purpose at hand. The 
task may be carried out as outlined in 
Figure 4. This example may serve to il- 
lustrate the high degree of specification 
attainable in this type of machine search- 
i ng. 

5. In order to achieve more freedom in 
translating diversified and intricate 
statements in a natural language into 
machine language, logic operators, ^ 
or quantifiers (Figure 5) could be 
called in. 

These quantifying operators could be 
introduced as special notes by which to 
evaluate properly the results of a truth- 
matrix analysis. Alternatively, the 
quantifiers could be approximated, for 
purposes of computation, by prearranged 
threshold values which would then be used 
like probabilistic weighting factors 
(above) . 

The functional relationship of information re- 
trieval systems to arithmetic compiler languages 
may be noted here. Documentation work essentially 
deals with patterns of thought and their relations 
to specific units of recorded information. While 
it may be undesirable to mold work in such a dis- 
cipline on a technique of programming mainly 
arithmetical processes, it may nevertheless be 
profitable to uphold compilers as good examples, 
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philosophically speaking. Compiler languages, 
such as Fortran, Speed, etc., are representat ions 
at the operator, or function, level. The tran- 
scription of documents, instead of being done on 
an elementary symbol -by-symbol basis, without 
reference to context, should also graduate to the 
function level. The relationships among the clue- 
words of a document should be inherent in the 
record . 

The Progressive Stages Of Information Handling 

Before proceeding with the application of symbol- 
ic logic, one may review the need for it. 

The point of departure for all documentation re- 
search can be said to be the traditional library 
index. The first step in this work is due to the 
need to go from the fixed and little expandable 
subject categories of the library index to flexi- 
ble categories of information such as are provided 
by many special libraries today. 

In multi-aspect indexing, documents are stored 
under groups of clue-words which characterize 
their contents, and retrieval is effected by re- 
quests made up of any combination of clue-words. 
This is a long step ahead and away from fixed- 
category filing. 

It now appears that the machine runs into a snare 
on retrieval. If all the clue-words pertinent to 
a document are combined into one large group, 
there will be false drops due to lack of resolu- 
tion. If, on the other hand, a document is in- 
dexed only under a few clue-words, it may be over- 
looked on many requests. That is to say, much of 
the advantage of the system over traditional in- 
dexing is then lost. 

This is the central quandary of information han- 
dling. Many authors have attacked the problem in 
many ingenious ways, and some of them have suc- 
cessfully moved far beyond that stage, 5, 6, 7, 8, 9, 
to find an optimum solution. 

One trend is thus toward more detailed indexing, 
and another toward simplification. Moreover, 
these trends are also tied in with machine speeds 
and machine memory capacities. Consequently, most 
of the new work in documentation research is cen- 
tered at the intersection of these two trends. 
Among the varied results which seem to arise, not 
all, unfortunately, are the products of the essen- 
tial double-criterion: 

Is the system commensurate with the specific in- 
formation needs to be dealt with, and is it exten- 
sible? 

Three levels of document description can be 
def i ned : 

1. identification of symbols 

2. identification of their order 
identification of their logic relations 


In this last level, information concepts can 
find expression through logic relations among 
the underlying clue-words. (Figure 6) 

Symbolic Logic Treatment Of Information 

Machines are now linked with the storage, analy- 
sis, and retrieval of information. There is 
little point in attempting to discuss information 
handling without rapid electronic computers, but 
it must be realized that the greatest contri- 
bution of the machine to this field does not lie 
in speed, but in the right type of searching. 

The latter depends on the design of the search 
program which depends, in part, on the method of 
storing the information. From the viewpoint of 
logic, the encoding of the information for stor- 
age and the encod i ng of search requests can be 
regarded as one and the same task. The mechanics 
of storage assignment for the former, and the 
mechanics of matching clues for the latter oper- 
ation, are the only important points in which 
they are different. 

To write a search request means to write a set of 
specifications which should be as precise as 
possible. The burden on the writer of the re- 
quest can be immensely lightened and he can be 
enabled to write a great many such requests in 
the time formerly allotted to one request, by the 
simple expedient of having the machine compose 
the request for him. This can be done through 
symbolic logic, as shown above (Figures 2,4,6). 

All the essential facts known about a document, 
such as the title, serial number, origin, year of 
publication, location in a file, descriptive 
terms, and its subject content, (materials in- 
volved, properties observed, processes performed, 
ambient conditions, applications, etc.) are 
merely listed, regardless of possible redundan- 
cies. Each one of the facts or propositions 
listed may be linked with some of the other prop- 
ositions or with extraneous useful matter con- 
cerning the document sought. There now arise a 
number of statements, such as: a or else b, (Aab) 
a or b or c or a I I three, (VVabc), a implies c but 
not b unless d and e are present, (KCVNdNeCacCKde 
CaKcb) , etc. The machine is directed to run 
through the Truth-Matrix Analysis as discussed 
earlier', and the result is a binary number of 2 n 
bits (for n propositions). This number, called 
the characteristic of the statement, is obtained 
one bit at a time; for every "yes" or I bit, a 
binary number of n bits (for n propositions) de- 
noting the combination of propositional values 
corresponding to the "yes" result, is automati- 
cally recorded in memory. A set of these numbers, 
correspond i ng to a given document, constitutes a 
complete assertion pattern for that document (on 
encoding) or a complete description of the search 
request (on retrieval). A special but frequently 
occurring case is that of d is joi nt characteris- 
tics which is due to a statement in which the 
propositions can be grouped in independent, or 
disjoint, substatements. 


3 . 
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For the example used above, the disjoint charac- 
teristics would be: 

00 10000000000000 

00 II 00 I 100000000 
000000000000 I I I I 

1 I I I I I I 1 00000000 
00000000 1 1 000000 
000000000000 I I I I 
0000001 1 000000 1 I 

The assertion pattern derived from the above 
characteristics would be: 


0010 

0000 

1 100 


0001 

1 101 

0010 

0010 

1 1 10 

001 1 

001 1 

1 1 1 1 

01 10 

0100 


01 1 1 

0101 

01 10 


01 10 

01 1 1 

1 100 

01 1 1 

1 1 10 

1 101 


INI 

1 1 10 

1000 


1 1 1 1 

1001 



This assertion pattern is compared with assertion 
patterns of all entries within a pertinent rang®. 
The pertinent range is defined by combining the 
original statements with a standard set of con- 
clusions, e.g. implications like: water evolution 
on distillation of a solid implies a hydrate, re- 
action with nitrous acid implies the presence of 
a primary or secondary amino-group etc. The re- 
sulting combined character i st i c clearly outlines 
the possible entries sought, and the search pin- 
points it. 

The basic logic "in-and-out" procedure may be 
summarized as follows: 

a . Analyzing and Coding : 

1. Collect facts or propositions 

2. Compose propositions into logic 
statement 

3. Set up proposition-key (dictionary), 
by defining the propositions 

4. Analyze statement and obtain its 
identifying assertion pattern 

5. Store character i st i c and proposition- 
key addresses 

b . Analyzing and Retrieval : 


4. Match assertion patterns 

5. Check proposition-key for print-out 
of meanings. 

The disclosure obtained in this manner is highly 
unambiguous, having been released upon matching 
a assertion pattern which denotes the same logic 
relationships as that of the request. 

The dictionary entries would be kept out of the 
way of the character i st i cs . When part of a doc- 
ument record is devoted to textual material and 
part to a code, the effective search rate is 
severely cut due to the idling of the circuitry 
during the passage of the textual part of the 
record. The search operation should therefore 
be broken into two portions, the first being the 
matching of the logic form or assertion pattern 
of the document and of the proposition code, and 
the second being the special table look-up in the 
proposition-key (Figures 2,4,6). The latter op- 
eration results in a match. On the other hand, 
the proposition code and its dictionary could 
remain related for easy identification of the code 
for a document. 

The amount of detail used in describing concepts 
stored or disclosed in the document file depends 
on the needs of the user and the nature of the 
application. These will also determine whether 
a single-shot search procedure or an iterative 
analysis procedure is to be followed. If the 
file for an area is appreciably less branched in 
its descriptive terms than is the type of request 
addressed to it, the iterative analysis approach 
may be more advisable. No special encoding method 
is necessary in the preparation of iterative anal- 
ysis searches. The form of the request suffices 
to determine the depth of the search and the 
logic classes to be considered during that search, 
or that step of the search. The gradual branch- 
ing of logic decisions is automatically controlled 
by changes in the request statement. 

The iterative variant of symbolic logic searching 
can be employed to determine the degree of rele- 
vance of a disclosure as compared with that of an- 
other. it is necessary only to run several 
searches, each passing through a different logic 
network and to count the matches of certain indi- 
vidual clue-words on the records of various docu- 
ments examined. 

Again, it is the needs and the skill of the user, 
and the nature of the application, that determine 
the extent of automat icity desired in the compo- 
sition of the request sentence. 


1. Collect facts or propositions 

2. Compose request propositions into 
logic statement 

3. Analyze statement and obtain its 
assertion pattern 


A further study of the role to be played by logic 
request sequences in the guiding of the design of 
electronic equipment for searching and selecting 
shou I d be made . 
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Sufficient Clues And Probability Aspects of decimal fractions for probability factors, in 

lieu of 0 and I alone. Dependence must be set 

Besides the need for retrieving information, on the type of probability data obtained. If the 

there is also the necessity to analyze documents relative probabilities were given in terms of the 

for various purposes. It may be desired, for in- reverse relations, i.e. in terms of descriptors 

stance, to examine a collection of data from with respect to documents, rather than the other 

various sources in order to deduce the answer to way round, then some appropriate transformation 

an apparently extraneous question. Or, on re- formula would be used.*' Alternatively, without 

ceiving or recalling various barely related facts, much extra effort, weighted multiple retrieval 

it may be desired to find a special pertinent could be effected, i.e. each of several documents 

document. Such situations will be referred to as could be retrieved with an attached order of 

"sufficient clues" problems. (The word, suffi- desirability. It would then be possible to have 

cient, is used on a hopeful note). the machine judge through a numerical criterion. 

The operator may, on the other hand, prefer to 

Suppose, as a simple example, that these items use his own judgement as to the comparative suit- 

are collected: ability of his results. 


1. There are three authors, x,y,z, each 
having written one of three documents, 
oc, ft , y , not necessarily in this order. 

2. There are three other documents, a,b,c, 
the respective authors of which are the 
same as those of the three above docu- 
ments, respectively. 

3. Document c deals with topic one 

4. Author y writes on a second topic 

5. Document b is not mathematical in nature 


Justification for many of the manipulations of 
probability factors arising in this field is be- 
yond the scope of this paper and may be found in 
the Luce-Raiffa form of utility reference theory 
formulated by von Neumann and Morgenstern. 


x) e.g., if p(a:b|) is the probability 
with respect to b ; , 
and p ( b j : a) , mutatis mutandis, 

p(bj :a)=p(a;bj )p(b P^ a :t> j) p(b j) 


of a 
then 


6. One of documents a,b,c, namely the one 
marked by the corresponding greek letter, 
(designating a paper written by y) deals 
with a third topic. 

7. Author y has written on the topic of 
that one among documents a,b,c which has 
mathematical symbols in it. 

8. Author z does not know document** 

Upon a question such as: Which is the author of 
documented? It can be shown that the straight- 
forward answer, "x is" can be rather suggestively 
obtained by means of the same symbolic logic 
truth-matrix analysis as that used above. It is 
necessary merely to make a propos i t ion-d i ct ionary, 
jot down all the sentences occurring and let the 
machine compose them into an overall statement. 
The character i st i c will give the answer. 

(Figure 7) 

Of course, the actual purpose of using "suffi- 
cient clues" is the correlation of hints and 
guesses in hopes of locating a document fitting a 
very narrow description. 

Many of the decision-making procedures developed 
from symbolic logic principles can be readily 
adapted to the calculation of probability factors, 
instead of bare "yes" and "no" answers. In the 
present application to documentation, this is 
likewise the case. The algebraic equivalents 
through which the logic calculations are pro- 
grammed for the computer permit the introduction 


Some Aspects Of Machine Translation 


In view of the parallel relation between the 
symbolic logic approach to both document encoding 
and information retrieval, it is easy to see the 
further parallel relation between these and ma- 
chine translation. 


The subject of encoding per se has not been dis- 
cussed in the present paper. Suffice it to say 
that, for the construction of a flexible, eco- 
nomical and inclusive proposition-dictionary, 
the Semantic Code discussed by John L. Melton^ 
appears to be exceptionally promising. Other 
approaches to this part of the information prob- 
lem are also being considered. 

Machine translation, (MT) especially translation 
from one natural language to another, has receiv- 
ed a great deal of attention and many results 
have been reported to-date from various parts of 
the world. This subject deserves more than a few 
remarks at the end of the present discussion. As 
a matter of fact, machine translation should be 
treated in a report devoted to the subject of the 
utilization of essential scientific data from 
foreign countries. A brief outline of the con- 
tribution to MT of techniques based on symbolic 
logic analysis will be given here. 

The main tasks in MT are: 


Construction of a formal system (in- 
cluding word lists) for describing 
natural languages. 
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2. Definition and evolution of algorithms 
for transferring (translating) from one 
system to another. 

3. Development of the principles necessary 
for programming and coding these 
algorithms into the 'machine. 

Symbolic logic as a tool enters into all three 
problems, especially the first one. The third 
problem, being the one most closely related to 
the similar problem in information searching 
generally, responds directly to the symbolic 
logic treatment discussed above. In other words, 
the truth-matrix analysis, the binary character- 
istic, the group-theoret i ca I relations among 
character i st i cs , the logic pattern matching tech- 
nique are all applicable to the handling of in- 
formation consisting of linguistic algorithms. 

The second problem is often merged and sometimes 
confused with the first. This happens largely 
because of the difficulty of divorcing semantics 
from the compilation of appropriate word- lists. 

The development of the necessary algorithms is 
basically a logic problem. For the source-target 
language pair, the proper formula structure can 
be set up by symbolic logic. Here, of course, 
quantifiers play an increasingly important role. 
The search for semantic structure is, in fact, 
the prime effort in today's move away from the 
primitive word-by-word look-up idea of MT. Mul- 
tiple meanings of words as well as grammatical 
structure of sentences cannot be taken' into ac- 
count without this search. 

Some successful attempts have been made to attack 
the multiple word meanings by the statistics of 
word-f requenc ies . The grammatical structure of 
sentences is being widely investigated by means 
of morphology (word order and inflection) applied 
to both source and target I anguages . I 2, 1 3 |+. is 

expected that symbolic logic character i st i cs can 
here be used advantageously to differentiate in a 
very simple way between structures. 

By matching the constraints in structure of the 
source language with those of the target language, 
meanings rather than word-messages could be trans- 
lated and furthermore, the storage requ i rements of 
MT can be much reduced in this way. The size of a 
we I I -organ i zed glossary is, in fact, another lead- 
ing question so far left unanswered partly because 
of the uncertainty as to the eventual results of 
structure studies. In the Russian language, for 
instance, 86# of the running words are inflected, 
and the total number of entries required for one 
noun average from 6 to 10, and one verb may re- 
quire 59 entries^. The split-glossary technique 
of Dr. Milos Pacak, of Georgetown University, 
based on morphological analysis, may provide a 
much needed short cut. The classes and sub- 
classes proposed by Dr. Pacak can be controlled 
by logic considerations. 

A system of "Semantic Semaphores" is in prepara- 
tion by the author. This, it is hoped, will aid 


in the establishment of word relationships and 
the preparation of word patterns relating to the 
compilation of thesauri. These, of course, also c 
call for decisions based on familiarity with 
specific technical fields. 

For input to the machine, a satisfactory approach 
may be that of W. R. Nugent ^ or one developed in 
the U. S. Patent Office^. Mr. Nugent uses 
pseudo-alphabets having mnemonic names, which 
can easily all be directed by one single language 
pattern, but permitting a symbol capacity of 500Q. 

Cone I us ion 

Lest one become so deeply involved in theoret- 
ical considerations of a retrieval system as to 
lose sight of its practical requirements, he 
should keep in full view the logical economy with 
which human beings, such as a good research li- 
brarian or a good oral interpreter, work. 

The dexterity of the specialized human mind in 
"homing" rapidly toward the search goal must re- 
main the ultimate aim of the documentation re- 
searcher, until that aim is reached and can be 
surpassed by a machine of machines. 

Too often, an information system is exhorted be- 
cause it has a remarkably large file or an astute 
way of responding to one type of request. For 
that matter, a dove picking out someone's fortune 
from a large pile of envelopes is also remarkable, 
if somewhat one-sided. It is perhaps not so much 
what a human being looks for in searching a li- 
brary, as what he discards along the way that 
should tip off those who wish to direct machines 
to do as well as human beings. 

A student having studied up on a large variety of 
topics for a comprehensive examination, seems to 
expel from his mind, as soon as he sees the ques- 
tions on the examination, all those topics which 
do not pertain to these questions. A trained li- 
brarian or a trained interpreter similarly cuts 
down with bold strokes the search area through a 
few significant decisions which rapidly brings 
him to the eventual choice. Many other people do 
this in other connections. 

What the machine must be told is how to perform 
this "pre I im i nary" narrowing-down process which 
enables it to undertake the search "proper" only 
after the most time-consuming blind-man's buff 
through the logic network has been carried out 
and the irrelevant paths have been discarded. 
Machine searching, similarly to machine trans- 
lation, may be best divided into two stages: the 
rough search (or rough translation), and the fine 
search (or fine translation). No exact theoret- 
ical dividing line can be g’iven. 

In a human being, thi's narrowing-down process 
works by means of two-valued logic, and there- 
fore it must work likewise in the machine. The 
question now arises: Can present-day machines 
or machines now being designed along well-worn 



lines and capable perhaps of expediting the "fine 
search 1 ', be expected to master the real executive- 
robot task of the "rough search"? 

It is suggested that the answer is: No! Tne 
"rough search", the portion of the work that al- 
most demands a chess-player's ubiquity and judg- 
ment coupled with far superior speed, is not like- 
ly to be performed adequately for some time to 
come. In machines, non-ar i thrnet i ca I design con- 
cepts with as yet unrealized searching skills 
will first have to be developed. It rnay be well 
to have a close look at one of the few truly 
progress-or i ented machines now in existence, viz. 
the Searching Selector at Western Reserve 
Un i vers i ty 16, 5 . 

A system of "logic traps", as mentioned above, 
must be achieved which rapidly discards the 
classes of topics that are not wanted by answer- 
ing yes or no to well-ordered questions. It will 
not do merely to make up for the lack of adequate 
binary-logic networks by larger and still larger 
files and by greater machine speeds. 

Often, the aspiration to higher speed obscures 
the purpose of searching. Of course, the highest 
possible speeds are desired and, in fact, machine 
speed is in itself a result of the improvement of 
the logic search paths. Shortcomings in logic, 
however, cannot be covered up by fast "horsework", 
either in the machine or in the paper-and-penc i I 
system which may serve as a program for the ma- 
chine. It is as incongruous to expect good re- 
sults from a hasty approach in this field, as it 
would be for some computer manufacturer to expect 
good results if he felt that not every one of the 
magnetic core elements in his computer's memory 
needs be built carefully just because there are 
so many of them. 

As Dean Shera remarked, " The dim light of the 
electronic tube has led us ever faster along the 
wrong path. For it is not in speed, but in capa- 
bilities that the great promise of automation 
lies..." 

One may note that transistors shed an even dimmer 
light. 
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FIGURE I 


TABLE OF TWO-PROPOS ITIONAL FUNCTORS 


Polish Traditional Character i st i c 


Prefix Notation 

T i e-Symbo 1 

Des i gnat i on 

Mean i ng 

Value 

Na 

a 

non 

not a 

10 

Kab 

a* b 

et 

a and b 

0001 

Aab 

aAb 

aut 

a or else b 

01 10 

Vab 

avb 

ve 1 

a and/or b 

01 1 1 

Pab 

a^b 

Peirce's arrow 

neither a nor b 

1000 

Eab 

a=b 

par 

a 1 i ke b 

100 1 

Cab 

aab 

prae 

if a , then b 

1 101 

Sab 

a/b 

Sheffer's stroke 

not both a and b 

1 1 10 


FIGURE 2 

TYPICAL SIMPLIFIED PROPOSITION KEY 
(kept down to four propositions for each of 7 disjoint statements) 


Physical Description 

a = it is a liquid substance 
b = it is a gaseous substance 
c = it is a crystalline substance 
d = it is a cofored substance 

Melting or Boiling Point 

i = it has a melting point (m.p.) 
j = it has a boiling point (b.p.) 
k = its m.p. is exactly 114 degrees 
I = its b.p. is exactly 222 degrees 

D i st i I I at ion 

q = it distils 

r = it loses water on distillation 
s = it sub I i mes 
t = it tars 


Compos i t i on 

e = it contains N 
f = it contains 0 
g = it contains only C and H 
h = it contains S or P 

So I ub i I i ty 

m = its solubility division is W 
n = aqueous solution acid to litmus 
o = aqueous solution basic to litmus 
p = aqueous solution has pH S 

Reactivity of Substance 

u = distillate reacts with nitrous acid 
v = it produces an acid 
w = it reacts with pheny I hydraz i ne HC£ 
x = it produces an amide 


Reactivity of Derivative I 


y = derivative I has m.p. of degrees 
z = derivative I has b.p. of $ degrees 
a = derivative I is water-soluble 
o = derivative I is i n so I ub i I i ty d i v i s ion 



FIGURE 3 
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. EAnivirLL or_ Li^um-mr i rux 
jj(a'£b) vc]* (a7bvS7cva7c)]v (a*b)* c 
VKVNPabcNVVNSabNSbcNSacKKabc 


a 

i < 

b 

c 

a^b 

a^b 

(a\|fb) vc 
a/b 
a/b 
b/c 
b/c 
a/c 
a/c 
yvwvz 
yvwvz 
u- x 
a- b* c 

(u- xj v(a- b* c ) 


0 0 0 0 1 I I I 

0 0 1 10 0 1 I 

0 I 0 I 0 I 0 I 

1 10 0 0 0 0 0 

0 0 I I I I I I 

0 I I I I I I I 

1 I I I I I 0 0 

0 0 0 0 0 0 1 I 

1 I I 0 I I I 0 

0 0 0 10 0 0 I 

1 I I I I 0 I 0 

0 0 0 0 0 10 I 

0 0 0 I 0 I I I 

1 I I 0 I 0 0 0 

0 1 I 0 I 0 0 0 

0 0 0 0 0 0 0 I 

0 1 I 0 I 0 0 I 


X 


Y 


w 


z 


u 


character i st i c 


FIGURE 5 

QUANTIFYING OPERATORS 



Pref i x 
Symbol 

Name 

Mean i ng 

e.g. 

1 . 

Jtx 

ALL-operator 

for every x 

RxCxy 

2. 

2 x 

EX ISTENCE-opera+or 

for some x 
(at least one) 

£xCxy 

3. 

Ox 

UN IQUENESS-opera+or 

for exactly one x 

OxCxy 

4. 

Mx 

MAX 1 MUM-operator 

for at most one x 
(none or one) 

MxCxy 

5. 

Jx 

CLASS-operator 

the c 1 ass of a 1 1 
x’s such that 

JxCxy 
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FIGURE 4 

TREATMENT OF REQUEST WITH WEIGHTING FACTORS 


1) Proposition-Key: 

a = rubber material for use in 0-ring packing 
b = the packing to be exposed to hydraulic fluid on the inside 

c = the packing to be exposed to atmospheric oxygen on the outside 

d = the hydraulic fluid is an organic phosphate ester 
e = the packing to withstand 225°F hydraulic system temperature 

f = the packing to withstand mechanical abrasion 

2) Weighting Factors: 

the relative importance of b may be .8 

the relative importance of c may be .6 

the relative importance of e may be .5 

the relative importance of f may be .3 

These weighting factors may be re- i nterpreted as probabilities that the given property 
alone fulfil to requirements of the material. 

The entries to be searched will, likewise, be set up with their respective weighting 
factors, thus making possible a preference order among the disclosures. 

3) Symbolic Statement: 

KKKKKabcdef (with b=.8,c=.6,e=.5, f=.3) 

4) Assertion Pattern: 

I , .8, .6, I , .5, .3 


FIGURE 6 

RELATIONS AMONG CLUE-WORDS 

Request : 

Article both in English and concerning aircraft or spacecraft, written neither before 1937 
nor after 1957; should deal with laboratory tests leading to conclusions on an adhesive, 
used to bond metal to one of these: rubber or plastic; the adhesive must not become brittle 
with age, must not absorb plasticizer from the rubber adherent, and have a peel strength of 
20 lbs. /in; it must have at least one of these properties: no appreciable solution in fuel 
and no absorption of solvent 

C I ue-Words : 

English, aircraft, spacecraft, 1937, 1957, laboratory, adhes i ve ; meta I , rubber, plastic, 
brittle, plasticizer, pee I -strength, fuel, solvent 
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FIGURE 6 
(Cont i nued) 


Propos i t i on-Key : 
a = it is an article in English 

b = it is an article concerning aircraft 

c = it is an article concerning spacecraft 

d = it is an article written before 1937 

e = it is an article written after 1957 

f = laboratory tests were run 
g = conclusions on an adhesive were obtained 
The Adhesive: 

h = the adhesive is used to bond metal to rubber 

i =5 the adhesive is used to bond metal to plastic 

j = the adhesive may become brittle with age 

k = the adhesive may absorb plasticizer from the rubber adhesive 

I = the adhesive may have a pee I -strength of 20 lbs./ in. 

m = the adhesive may have appreciable solution in fuel 

n = the adhesive may have absorption of solvent 

Symbolic Statement: 

KKaVbcPdeCf g 
KAhiKKKNjNkISmn 


FIGURE 7 

A "SUFFICIENT CLUES" PROBLEM AND SOLUTION 

x,y,z = documents written by authors x,y,z, respectively 
a, b,c = documents a, b , c, respectively 

p,q,r = the topic of the document is one, two, three, respectively 
m = the topic of the document is mathematical 

From statements I. and 2. of the Problem, one gets 6 sentences of this form: AAxyz, 
all connected by conjunctions, and 18 sentences of this form: CC^xKC^AyzCC^gCgz, 

9 for<v ,jg,J and 9 for a,b,c; all connected by conjunctions 

From statements 3. ,4., 5: Ccp,Cyq,CbNm, 

From statement 6, one gets 3 statements of the form: CCCyotar 
From statement 7: KKCCamayCCbmbyCCcmcy 
From statement 8: OrNz 

The total characteristic of 2 ^ bits simplifies to an assertion pattern of 13 bits which 
indicates the presence of the relationship VNe<x or Gxx, i.e.oi^x 
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THE FACT COMPILER: A SYSTEM FOR THE EXTRACTION, 

STORAGE, AND RETRIEVAL OF INFORMATION 

Charles Kellogg 

Intellectronics Laboratories, Ramo-Wooldridge 
Canoga Park, California 


Summary 

The Fact Compiler is a system for the timely 
extraction of significant information from source 
data and for the storage of this information in an 
organized manner that permits rapid retrieval. 
In addition, the Fact Compiler can process or 
manipulate the stored data in a variety of ways, 
and it is adaptable for use with presend-day re- 
porting techniques. The system is capable of or- 
derly growth to meet the changing requirements of 
growing organizations. 

Information is stored according to a logico- 
linguistic structure. This structure enables the 
system to: (1) directly interrogate personnel and 
request the reporting of specific information; (2) 
automatically present desired data at the appro- 
priate time intervals; and (3) retrieve information 
according to subject, aspect, date , degree of 
specificity, and organizational unit. 

The Fact Compiler 

As society matures, production and use of in- 
formation greatly increases. At the same time, 
individuals tend to narrow their fields of speciali- 
zation in order to cope with this growth of infor- 
mation. Thus, the old adage about knowing more 
and more about less and less becomes more 
meaningful as time goes on. It becomes increas - 
ingly more difficult to assimilate and evaluate the 
mass of data contained in the paper work flooding 
many business, government, and university or- 
ganizations. The techniques of automation have 
been applied in various ways to alleviate this prob- 
lem. This paper will discuss an advanced system, 
the Fact Compiler, which uses new concepts to 
automate the storage and retrieval of information. 

The Fact Compiler is a system for the timely 
extraction of significant information from source 
data and for the storage of this information in an 
organized manner that permits rapid retrieval. It 
has been designed to function as a centralized 
storehouse of important information. In addition 
to retrieving information, the Fact Compiler fa- 
cilitates the processing or manipulation of the 
stored data in a variety of ways not readily possi- 
ble with decentralized storage systems. The sys- 
tem is readily adaptable for use with present re- 
porting techniques used in industry. At the same 
time, it can be used with fully automated input. 
The system is capable of orderly growth to meet 
the changing requirements of growing organiza- 
tions. The Fact Compiler is an in-line processing 
system. That is, input data is immediately re- 
corded in the memory and, therefore, is available 
for immediate use. The rapid response of the 
system permits the prompt discovery of trends in 
the stored data. It also encourages personnel to 
"browse" for information as an aid in finding all 
relevant data. The stored information, even if 


received from widely divergent sources, can be 
organized automatically and used in the genera- 
tion of summary reports. 

A key problem in designing the Fact Compiler 
was the development of a language capable of ex- 
pressing the communication needs of personnel 
engaged in the input of information or in the for- 
mulation of a request for information. A large 
amount of linguistic and empirical study has re- 
sulted in a language structure that satisfactorily 
meets these objectives. Development of the lan- 
guage vocabulary required a precise formulation 
of the information needs of system users. These 
needs are expressed as a series of questions or 
interrogations which are indicative of the kind of 
information that should be stored within the system. 

Interrogations are organized according to a 
logico-linguistic structure and are stored in the 
system memory. Input information consists of 
responses to the appropriate interrogations. The 
combination of an interrogation and its associated 
responses is called a factual statement. A fact is 
defined as an interrogation and one particular as- 
sociated response. 

General system features are shown in Figure 1. 
The system is composed of human and machine ele- 
ments. In general, system personnel exercise 
judgment and perform decision-making functions, 
and the equipment elements perform the necessary 
routine processing. System personnel monitor 
and control operational functions, and aid in the 
input of data. The systemusesa general-purpose 
digital computer and a high-capacity, rapid- 
access, magnetic memory. The greater portion 
of this memory is devoted to the file of facts. 
Other important files are the dictionary, or vocabu- 
lary of terms, and the interrogation vocabulary 
file. The input or extraction process is facilitated 
by the use of a communication console having an 
alphanumeric display tube and a special set of keys 
for data insertion. The output or retrieval proc- 
ess may use either the communication console or 
a high-speed printer. 

Use of a Fact Compiler in a Business 
Firm 

The concepts involved in the use of a Fact 
Compiler may be visualized in the application of 
the system to a hypothetical business firm. The 
organization structure of a typical business firm 
is illustrated in Figure 2. Some of the basic re- 
sponsibilities ofbusiness management are the de- 
termination of the future course of the business by 
farsighted planning, the selection of qualified key 
personnel, development of a sound organization 
plan, and effective control through the delegation 
of responsibility. Management receives the data 
necessary for making decisions in these areas 
from a series of reports generated by line and 
staff departments. A common phenomena among 
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executive personnel is that of being "swamped" 
with reports. Development of procedures to filter 
information destined for management (i. e. , se- 
lection of pertinent information and rejection of 
unnecessary data) is an important function usually 
performed by staff departments. 

With a Fact Compiler, management and staff 
would, in effect, have an "extended memory" with 
which they could review particular aspects of the 
corporation's past history in various levels of de- 
tail. This type of review may permit better fore- 
casts about future conditions such as costs, sale 
prices, raw material availability, and product de- 
mand. Also, the compiler could supply data in- 
dicative of the degree of control in effect within 
the firm. Correlations could be made between 
activities not previously compared and general 
economic conditions. 

Interrogations for a business Fact Compiler 
System are organized into interrogation lists rep- 
resenting major subject categories of interest. 
Typical interrogation list categories might be 
"factory," "regional sales district, " "subsidiary," 
"budget, " "taxes, " "legal, " "patent, " and "cor- 
porate assets." Information to be stored in the 
memory for a particular factory, regional sales 
district, or department is placed in an individual 
record for that factory, district, or department. 
This record is called a Unit Record. Unit Records 
may store information derived with the aid of one 
or more interrogation lists. The use of interro- 
gation lists for data extraction and Unit Records 
for storage and retrieval is fundamental to the op- 
eration of the Compiler System. 

Fact Compiler Language -Restricted 
English 


A solution to the communication problem be- 
tween humans and the computer files was of crit- 
ical importance in the development of the Fact 
Compiler. It was evident that neither natural 
English nor machine language could be used. A 
compromise had to be made. After much experi- 
mentation, a language was developed which has 
proved satisfactory for both personnel and the com- 
puter; the language is called Restricted English. 

Restricted English consists of a vocabulary of 
specially selected English words that are familiar 
and meaningful to personnel as well as being di- 
rectly related to the types of interrogations and 
information requests that the system is designed 
to handle. Words are carefully selected with re- 
spect to meaning, since synonyms are not per- 
mitted. Restricted English terms maybe "sim- 
plex" (single words) or "complex" (a word group- 
ing or phrase). Complex terms are formed to 
represent unique, commonly occurring concepts. 
Rules have been developed to aid in the translation 
process from natural English word groupings to 
Restricted English simplex or complex terms. 

The language consists of five parts of speech. 
Each of these different term categories serves a 
unique function. The parts of speech are: 

Substantive . A noun or noun complex; a name 
for a thing or object. 


Descriptor . Describes or limits definitions 
of substantives . They maybe ordinary adjectives 
or past participles with adjectival functions. 

Activity Connecto r. Relates several substan- 
tives in order to describe an action. They are usual- 
ly present active or present passive participles. 

Relational Connector . Prepositions, used for 
defining relationships between two substantives. 

Interrogative Operator . Interrogative adver- 
bial phrases used to determine the magnitude, 
quality, or position of things or objects of interest. 

Restricted English does not use conjunctions 
or pronouns. Typical examples of business- 
oriented terms are shown in Table I. Most inter- 
rogations may be formulated by using four or five 
terms; however, a few may require as many as 
seven or eight. 

The total set of terms defined for usage com- 
prises the Fact Compiler dictionary. It is esti- 
mated that a dictionary of a few thousand terms 
should be sufficient to handle most business stor- 
age and retrieval applications. Each term in the 
dictionary is coded and assigned a four-digit tag 
for internal computer use. The tag identifies the 
unique term and the part of speech of the term. 

Interrogation File Organization 


Just as the dictionary defines a vocabulary of 
terms for the computer, the interrogation file de- 
fines a vocabulary of interrogations. As mentioned 
earlier, interrogations are filed by major areas of 
interest into interrogation lists. Each interroga- 
tion list is further divided into a five -level generic 
classification structure. 


Levels of Generality 


1 . 

Interrogation List 

2. 

Capital Topic 

3. 

Major 

4. 

Minor 

5. 

Sub 

6. 

Sub Sub 


subject categories 

interrogation 

categories 


By assigning codes to each level, a "generic 
address" is formed which uniquely represents any 
interrogation used in the system. This address 
serves a very important purpose in identifying the 
subject category and level of generality of an in- 
terrogation. It also permits insertion of new in- 
terrogations at the end of a level without requiring 
a revision of the entire address structure as would 
be the case if an absolute address structure were 
used. 

Interrogation file structure is outlinedin Fig- 
ure 3. Each interrogation is represented by its 
generic address, a series of four-digit term codes 
denoting interrogation content, anda series of cri- 
teria. Criteria are defined at the same time as the 
interrogations and permit the computer to perform 
the following operations: 
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1. Present the interrogation automatically 
to an input analyst on the basis of a predetermined 
elapsed time since the interrogation was last 
answered. 

2. Present current factual data to personnel 
at desired periodic time intervals. 

3. Select next interrogation on basis of re- 
sponse to present one. 

4. Determine if an answer is in the proper 
form; if so, perform any processing that may be 
desirable before answer storage. 

Extraction of Informa tion — Input 

The computer may take an active or passive 
part in the input of new information. This de- 
pends on the situation and system requirements. 
With little or no historical data in its memory, 
the computer's role would be basically passive, 
waiting for personnel to request interrogation dis- 
plays and input answers. However, once initial 
data is stored, the Compiler System can compare 
data "age" with interrogation criteria and begin 
actively asking for new input at appropriate peri- 
odic intervals. This last feature can be quite 
useful in assuring the reporting of important facts 
and decreasing the possibility of reporting redun- 
dant facts. 

Upon receipt of a new document, an input an- 
alyst must determine the Unit Record to which it 
applies and then select the major subject category 
involved. This information is conveyed to the sys - 
tern via push-buttons, and the input analyst is pre- 
sented with a set of interrogation subject catego- 
ries on the display scope. A process roughly 
analogous to the game of twenty questions ensues. 
The analyst, now aware of the categories of in- 
terest, reads the document until he finds report- 
able information. Depressing a key onhis console, 
he generates an answer to an interrogation in the 
selected category. On the basis of the analyst's 
response, more detailed interrogations are pre- 
sented as long as affirmative answers are sup- 
plied; negative answers cause the generation of 
higher-level interrogations, different subject cat- 
egories, or result in terminating the interrogation 
generation. 

The extraction of significant information thus 
proceeds, with the analyst alternately scanning 
the document and then reporting data in as much 
detail as possible. This extraction procedure, of 
course, does not depend on the existence of a 
source document. Source data that is in any form 
recognizable by humans may be used. Personnel 
could directly report facts frommemory if desired. 

Fact Storage Organization 

The choice of an information storage file 
structure depends on many considerations such as 
the type of memory device used, estimated size 
of file, retrieval time requirements, and knowl- 
edge of the frequency distribution of various types 
of retrieval requests presented to the file. 

The Fact Compiler memory is capable of 
storing one million factual statements with an av- 
erage of ten answers per statement, for a total of 
ten million separate fact items. With this type of 


memory, it is desirable to store facts so that one 
or several entries to the memory will select the 
information records necessary to satisfy typical 
requests. 

If the frequency distribution of requests were 
known in detail and did not change with time, an 
optimum file structure could be developed. How- 
ever, the distribution estimated before the system 
is in operation is usually only a rough estimate, 
and the distribution will change with time as inter- 
est in various aspects of the stored information 
changes . 

If it is necessary for retrieval time to be 
minimized, factual statements maybe redundantly 
stored under several storage schemes , or storage 
schemes may be revised by the computer as the 
distribution changes with time. 

For most purposes, such a high service rate 
would not be necessary, and one of the two follow- 
ing storage plans would be satisfactory. One or 
the other plan would be chosen on the basis of the 
predominant types of requests. 

Fact St orage by Unit Record. A fact consists 
of an interrogation and the answer to that interro- 
gation, derived from a particular source docu- 
ment. It is also associated with a particular staff 
department, factory, sales district, or other Unit 
Record. A Unit Record storage plan is shown in 
Figure 4. In the Unit Record shown, all informa- 
tion extracted fromfactory XYZ source documents 
is stored. Major columns are the statement ge- 
neric address column and the response columns. 
A response consists of the answer extracted from 
a particular document (represented by its docu- 
ment number) and the date (date on document). 
Rows indicate specific generic addresses of inter- 
rogations used to extract information. An inter- 
section of row and column will provide an answer 
to a particular interrogation referenced to the as- 
sociated document number and date. In actual 
storage, each row is a separate variable -length 
field, and responses are scanned for selection 
purposes. Responses are stored in inverse time 
sequence, so the latest answer is nearest its ge- 
neric address. 

Fac t Storage by Substantive Record . The sec- 
ond storage plan is based on the part of speech 
which stands for the name of the thing or object of 
interest — the substantive. Storage of factual 
statements by the names of things referred to in the 
statements is a powerful method because these 
names or substantives play a dominant role in 
most retrieval requests. 

The layout of a Substantive Record storage 
plan is shown in Figure 5. Each Substantive Rec- 
ord contains the generic addresses of all state- 
ments which contain the particular substantive. 
The addresses are arranged in sequential order as 
in the Unit Record storage plan. However, as 
each address refers to one or more Unit Records, 
code numbers occur as sub-entries undereach ad- 
dress. These represent the Unit Records for which 
each generic address applies. Therefore, generic 
addresses act as main entries and have Unit Rec- 
ord code numbers as sub-entries. Each sub-entry 
has stored with it a set of responses in the same 
manner as the Unit Record. 
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Retrieval 

The concepts and procedures discussed up to 
this point pertain to the extraction of data from 
source documents and the storage of this data in 
an organized manner in a digital file unit. The 
stored data consists of factual statements — each 
statement consisting of an interrogation and a 
variable number of answers to the interrogation. 
Each answer is referenced to the number of the 
document from which it was obtained and the pub- 
lication date of the document. Each statement is 
generically related to other statements in an in- 
terrogation list and is filed in accordance with the 
Unit Record to which it relates. 

The retrieval system is capable of selecting 
the set of statements, the set of answers to these 
statements and, if required, the applicable docu- 
ment numbers that satisfy the conditions in a re- 
quest for information. 

Retrieval requests may take many forms. If, 
for instance, a top executive were interested in 
receiving the most up-to-date information on pro- 
duction schedules or sales forecasts, a console 
could be provided for him with a push-button for 
each category of information. Whenever the ex- 
ecutive pushed one of the buttons, a signal would 
be sent to the computer which would select the 
latest answers to the relevant statements and pre- 
sent this information on a display device in the 
executive's office. For highly standardized re- 
quests, this method would be simple and effective. 
However, if all requests were of this nature, the 
Fact Compiler System would probably not be eco- 
nomically justifiable, as this type of limited, 
highly predictable information retrieval could 
probably be performed by humans at less cost. 

As mentioned previously, the Fact Compiler 
is able to produce routine reports and present 
timely information on a daily basis automatically. 
It is felt, however, that the most important ad- 
vantages of the system are its ability to retrieve 
and relate items of information not ordinarily 
compared or mentioned in normal business re- 
ports, and its capability of reviewing the past his - 
tory of various aspects of business information 
which can be stored in some detail is desired. 
The ability to "probe in depth" into historical busi- 
ness data would seem especially advantageous for 
staff department personnel engaged in analysis of 
data for presentation to top management. 

To provide this kind of capability, communi- 
cation with the Fact Compiler must take place in 
a language powerful enough to express the many 
different and varied requests and yet simple 
enough sothat humans can readily express whatit 
is that they want from the file. An important part 
of the retrieval language is the vocabulary of terms 
used in defining the interrogations themselves. 
Terms are chosen from this vocabulary and are 
combined to express the factual information de- 
sired. A thesaurus will facilitate the correct 
choice of terms. 

Restriction of the request to Interrogation 
List or Unit Record categories is made by includ- 
ing the names of the lists or records of interest. 
Periods of time for answer selection may be in- 
corporated by using date operators such as: 


before-date, after-date, during-month (oryear). 
Grouping within a request will be provided by the 
use of the logical operators AND-OR and appro- 
priate use of punctuation. Certain arithmetical 
operations are allowable for selection and proc- 
essing of answers. Examples are: Answers ^ or 
^ a constant (for selection); sum, average (for 
processing a set of related answers). A retrieval 
request may be visualized as an algorithm which 
is capable of causing the Fact Compiler to select 
and present desired information to the requestor. 
The Fact Compiler retrieval language of English 
terms and operators permits appropriate connec- 
tion of terms, restriction of request scope, and 
definition of processing operations. The combina- 
tion of appropriate terms and operators will allow 
the ready generation of algorithms for the re- 
trieval of information. The retrieval process is 
outlined in Figure 6. 

As an example of a retrieval request, suppose 
it is necessary to have production and sales in- 
formation on all dishwashers produced after June 
1950. It is desired to relate this information to 
changes in the Gross National Product, Index of 
Industrial Production, and dishwasher develop- 
ment cost. The request may be formulated as 
follows: (Dishwasher, model number of, actual- 
forecast sales of monthly, actual-forecast produc- 
tion of monthly, development cost of; Gross Na- 
tional Product; Index of Industrial Production) 
after June 1950. 

1. Parenthesis indicate all answers are to 
meet the date operator requirement. The 
date operator is "after June 1950." 

2. Commas separate related terms. 

3. Dashes separate modifiers. 

4. Semicolons separate unrelated terms. 

The decoding section of the retrieval program 
would break down this request, via the operators 
used and a knowledge of the allowable syntax pat- 
terns in statements, into the following phrases 
which may be partial or complete statements in 
the file. 

1. Model number of dishwasher. 

2. Actual monthly sales of dishwasher. 

3. Forecast monthly sales of dishwasher. 

4. Actual monthly production of dishwasher. 

5. Forecast monthly production of 
dishwasher. 

6. Development cost of dishwasher. 

7. Gross National Product. 

8. Index of Industrial Production. 

The model number and production data may be 
stored in a Factory Unit Record. Development 
cost may be located in a Research and Develop- 
ment Unit Record. Sales information maybe con- 
tained in several Regional Sales District Unit Rec- 
ords. Items 7 and 8 are reportable under a General 
Economic Parameter Unit Record. 

In a file organized by substantives, all infor- 
mation is found under the three Substantive Rec- 
ords "Dishwasher, " "Gross National Product," 
and "Industrial Production." However, in order to 
find the information, the generic addresses of the 
pertinent statements must be known. These ge- 
neric addresses are found through the use of a. fact 



index. This index contains entries for all terms 
in the vocabulary and, for each entry, lists the 
generic addresses of each interrogation which con- 
tains that particular term. 

Consider phrases 1 through 6 above. Each 
phrase consists of three separate terms. All ad- 
dresses of facts that pertain to one of the phrases 
may be obtained by matching the addresses for 
each of the three entries and selecting those that 
are common to all three. When the appropriate 
sets of addresses are selected, the three Sub- 
stantive Records may be directly addressed, the 
generic addresses located, and the date restric- 
tions applied to date -answer combinations. 

It may happen that retrieval requests that 
would be unanswerable may be formulated. This 
would most likely happen when too specific a re- 
quest is made, so that the combination of terms 
used does not occur in any statement in the file. 
A request of this type would immediately be de- 
tected since conjunction of terms in the fact index 
would result in a zero set of generic addresses. 
This fact would be immediately presented to the 
requestor so he could modify his request. 

Conversely, if a request were quite general, 
a tremendous amount of information would be pre- 
sented. Before printing selected information, the 
computer would indicate the approximate volume 
of information that has been selected. 

Monitor and Control 

Figure 1 shows feedback to Monitor and Con- 
trol from the input or extracting process, and the 
output or retrieval process. This feedback is an 
important part of the self-correcting procedure 
necessary to keep system performance at a high 
level of efficiency. 

Feedback from the input process occurs when 
an input analyst finds that he cannot enter what he 
considers significant data into the machine mem- 
ory because there are no appropriate interroga- 
tions in the interrogation vocabulary. 

Feedback from the output process occurs 
when system users are not satisfied with the in- 
formation contained in system-generated reports 
or answers to retrieval requests. 

In either event, operator personnel must de- 
cide whether system capabilities should be changed 
to permit extraction, storage, and retrieval of 
new data. In making such decisions, they must be 
guided by firm management policy on the use of 
the system. 

Systems capability may be modified by mak- 
ing changes in the interrogation vocabulary or 
criteria, the Restricted English vocabulary, or 
the Retrieval Operator vocabulary. Such changes 
are easily made at a special display console op- 
erated only by monitor personnel. 

Additional functions of the group are to trans - 
late natural language requests into Restricted 
English, and to control and operate the system in 
accordance with the wishes of management. 

Conclusion 

The Fact Compiler System represents one 
solution to one category of information storage 
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and retrieval problems . The predominant philos - 
ophy involved in its development has been that of 
an appropriate division of labor between humans 
and machines. Humans are most efficient in mat- 
ters of judgment, machines in routine and precise 
processing. A machine will always need human- 
generated criteria in some form for determining 
what kind of information is significant or important 
since an absolute measure of the importance of in- 
formation does not exist. 

The value of a Fact Compiler System will be 
proportional to the amount of judgment and effort 
exerted in producing and maintaining a set of in- 
terrogations that adequately express the real re- 
quirements of system users. Producing such a 
set of interrogations is not an easy task but it is 
presently being done and is greatly facilitated by 
the use of a series of rules for translation from 
ordinary English to Restricted English Interroga- 
tion Lists. 

A clearly defined unambiguous vocabulary in 
a specialized field of interest permits direct stor- 
age of factual information within a computer mem- 
ory file. This file may be searched in several 
"dimensions" of index space. They are: 

1. time, 

2. subject category, 

3. organizational unit (Unit Record), 

4. aspect (substantives and their modifiers), 
and 

5. degree of specificity (generic level). 

In addition, the Fact File may be used as a 
powerful index to source documents. Requests 
may be formulated in a vocabulary which is easy 
to learn and to use. Generally-oriented or 
"browsing" requests may result in the discovery 
of relations between data never before apparent 
because of departmental report boundaries or be- 
cause of the long time periods involved. 

In many instances, management should be able 
to retrieve facts from the file quicker than they 
can find a report in a desk or file cabinet. Im- 
mediate access to this "extended memory" could 
help management in speeding up the decision- 
making process. 

The Fact Compiler System equipment and 
computer program is capable of handling any in- 
terrogation vocabulary and set of facts that are 
generated in an appropriate form. For example, 
use by a large university might consist of allocat- 
ing system time of several weeks each to medical, 
business, and engineering schools. Thus, the 
medical school might generate a Fact File of in- 
formation about the diagnosis of various diseases, 
the business school might use the Fact Compiler 
to compare data from major industries, and the 
engineering school might use the system to store 
and retrieve data on use and development of digital 
computers. Each school could readmits own vo- 
cabulary and data, and then extract, store, and 
retrieve for the duration of its allocated time. At 
the end of the six-week cycle, the process could 
be repeated. 

The organization of information within the Fact 
Compiler will permit the development of statistical 
procedures for automatic correlation of data within 
the file. This will be one of the objectives of future 
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system studies. It is expected that these studies 
will result in a capability for automatically finding 
cause and effect relationships between various 
types of information in the file. 
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Figure 2 Business Organization Chart 
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Figure k Unit Record Storage Plan 
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Figure 6 Retrieval 
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A WORD-ORIENTED TRANSISTOR DRIVEN 
NON -DESTRUCTIVE READOUT MEMORY 

by 

T. C. Penn and D. G. Fischer 
Central Research Laboratories 
Texas Instruments Incorporated 
Dallas, Texas 


Summary 

Using three -aperture ferrite memory cells, 
a 512-word non-destructive readout memory has 
been operated with transistor drivers. Writing 
and reading are both achieved with bipolar two- 
pulse sequences, which are, respectively, 

ERASE, WRITE and SET, READ. Due to the 
inherently non-destructive manner in which the 
cells are operated, no regeneration cycle is 
necessary to restore information to interrogated 
cells. The necessary bipolar writing and reading 
pulse sequences are conveniently provided by a 
novel magnetic switch matrix, in which all rows 
and columns are driven in coincidence except 
those intersecting at the desired address. The 
resultant driving current at the desired address 
is the sum of all the transistor driver currents, 
or 2(n-l) times the individual transistor driver 
currents for an "n" square matrix. 

Introduction 

Since the invention of the ferrite core 
coincident-current computer memory, there 
have been several systems advanced to utilize 
the best features of the coincident -cur rent 
system while improving upon or circumventing 
its deficiencies. Many of the alternate methods 
utilized a multiaperture device fabricated from 
a square hysteresis -loop type of ferrite as the 
building block for the memory storage or logical 
operations performed by the system. Each 
scheme used the ferrite material's ability to 
store a flux level or previously ordered flux 
geometry without a constant power expenditure. 

The Transfluxor, of Rajchman and Lo*’ 
offered proportional output instead of just ONE- 
ZERO operation, or, if desired, a random-access 
ferrite core memory with non-destructive 
readout. The Hunter and Bauer ^ coincidence 
flux schemes as further improved by Baldwin and 
Rogers^ were able to circumvent the coincident- 
current memory's inherent maximum drive 
current limitations and achieve improved access 
and switching times for a memory system. The 
dc bias offset method as used by Lawrence^ 
overcame the necessity of using square hysteresis- 
loop ferrite material. All of these systems were 
built upon multiaperture ferrite cells and re- 
quired driving currents somewhat larger than the 
coincident -current system. Since the coincident- 


current system in turn required driving currents 
on the order of one-half ampere, it was not 
feasible until recently to utilize solid state 
devices for driving a mamoery system. Con- 
stantine ^ has shown a load sharing matrix 
switch which is suitable for some applications. 

The memory system described here is 
also based upon a multiaperture ferrite cell 
which, superficially, differs little from previous 
such schemes, either in driving currents or 
core geometry. This system, however, embody- 
ing the multiaperture cell and the novel driving 
matrix developed for it, offers the prospect of a 
random-access transistor -driven computer 
memory with non-destructive readout and access 
times comparable to a coincident-current system. 

The multiaperture memory cell operation 
will first be described in detail. The novel 
driving matrix and the memory planes for a 
system using these cells will then be explained 
and finally the entire memory system is des- 
scribed. 

Multiaperture Memory Cell 

The basic memory cell is shown in Figure 
1. Cell geometry is important and the dimen- 
sions of this cell were fixed largely by compro- 
mise between performance, drive requirements, 
and fabrication ease. Cell performance is a 
fairly sensitive function of relative hole sizes 
and spacings (geometry), while drive currents 
vary inversely to the absolute hole diameters. 

The smaller the cells, the more difficult they 
are to fabricate by pressing from unfired ferrite 
powder. The cell chosen was near the minimum 
size which could be manufactured in the labora- 
tory without special production techniques. 

Every memory cell is normally linked by 
three windings, one each for Drive, Inhibit and 
Sense functions. Except for the necessity of 
noise cancellation in a plane of these memory 
cells, the Inhibit and Sense operations could both 
be served by a common winding. 

Writing and Reading are each accomplished 
with a bipolar, two pulse sequence applied to the 
Drive winding, utilizing driving core output in 
both drive and reset directions. This allows a 
maximum simplification of the means connecting 
driving core and memory cell. 
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The pulse sequences used are shown in 
Figure 2. ERASE, READ, WRITE and INHIBIT 
pulses are typically 0. 5 to 1.5 microseconds 
wide, depending upon clock rate used. SET 
pulse is typically 1. 5 times the width of the 
READ pulse. READ and WRITE pulses for 
simplicity are of equal amplitude, typically 1. 5 
to 2. 5 amperes. ERASE and INHIBIT pulses 
are typically 1. 5 to 3. 0 amperes. SET pulse 
magnitude is generally a third to half of the 
ERASE current. Output voltages are typically 1 
to 2 volts, with 0. 1 to 0. 2 microsecond widths. 
ONE/ ZERO ratios are generally 5/1 to 10/1 for 
and individual cell. 

A ONE is written into the cell by applying 
to the Drive winding the pulse sequence shown in 
Figure 2a. 

If the cell was previously unmagnetized or 
was storing a ONE, the cell's flux distribution 
following the ERASE and WRITE pulses are 
shown in Figures 3a and 3b respectively. A 
pulse of either polarity into the Drive winding 
results in a flux around the center hole only, 
regardless of magnitude. Although a current 
through the Drive winding applies a magnetizing 
force around both the small center and large left- 
hand holes, the maximum primary magnetic path 
length around the small hole is less than the 
minimum magnetic path length around the large 
hole. As the current pulse increases from zero, 
the area encircling the small hole is saturated 
before the mmf applied around the large hole 
becomes sufficient to overcome the threshold 
coercivity of the ferrite material. The flux en- 
circling the large hole cannot increase significant- 
ly through this saturated region. 

Considering the above, it is seen that the 
ERASE and WRITE pulses result respectively in 
clockwise and counterclockwise saturation around 
the small hole as shown in Figures 3a and 3b. 

A ZERO is written into the cell by applying 
an INHIBIT pulse in time coincidence with the 
WRITE pulse. As before the preceding ERASE 
pulse would have destroyed any stored information. 
The flux distribution following the INHIBIT or 
WRITE ZERO sequence is shown in Figure 3c. 
Saturation around the outside holes occurs because 
no net current flows through the center hole and 
mmf is applied only around the outside holes. 

This mmf directly opposes the post ERASE satu- 
rated flux around the small hole in the leg common 
to the left-hand hole. The mmf resulting from 
WRITE and INHIBIT pulses in coincidence there- 
fore destroys the flux encircling the small hole 
and allows this flux to be redirected around the 
outside holes. 

Readout is accomplished with the bipolar 
pulse sequence of Figure 2c applied to the Drive 
winding. To avoid extraneous output signals, 


stored information is detected by observing the 
Sense winding output voltage only while the READ 
pulse is applied to the Drive winding. Changes in 
flux linking the Sense winding during ERASE, 
WRITE and SET operations may also induce 
voltages in the Sense winding. These are, how- 
ever, ignored and only those Sense winding out- 
puts occurring in synchronism with the READ 
pulse are relevant to information stored in the 
cell. 

If the interrogated cell stores a ONE, the 
small SET pulse assures that the flux encircling 
the center hole will be in such a direction that 
the following READ pulse will reverse it and in- 
duce an output in the Sense winding. Successive 
READ sequences may repeat this dual reversal 
as often as desired, resulting in non-destructive 
readout of a ONE. 

If the interrogated cell stores a ZERO, the 
small SET pulse is of insufficient magnitude to 
overcome the coercivity of the flux path surround- 
ing the large left-hand hole. The SET pulse 
leaves the ZERO- storing cell arranged such that 
the following READ pulse causes no flux change 
about the center hole and therefore induces no 
output voltage(ZERO readout) in the Sense wind- 
ing. Repeated READ sequences also fail to 
effect flux changes in the cell storing a ZERO, 
allowing unlimited non-destructive readout. 

From the discussion of the READ sequence, 
the governing factors determining SET pulse 
parameters will now be apparent. To obtain as 
fast operation as possible, it is desired to make 
the SET pulse of high amplitude. As the SET and 
ERASE pulses are of the same polarity, there is 
obviously a SET pulse amplitude above which a 
stored ZERO is destroyed. By reduction of the 
SET pulse amplitude below the destructive read- 
out threshold, its volt-time area and hence flux 
reversal capacity has been decreased. The SET 
pulse is therefore increased in width to switch 
the majority of flux encircling the small hole of 
a ONE storing cell. 

The erase mechanism may be clarified by 
the following explanation. If the cell previously 
stored a ONE, the ERASE pulse has the same 
effect as a SET pulse, and the resulting flux con- 
figuration is as shown in Figure 3a. An ERASE 
pulse applied to a cell storing a ZERO first re- 
verses the clockwise flux encircling the left- 
hand hole, since the resulting mmf opposes this 
flux. This mmf also opposes the flux in the out- 
side leg of the right-hand hole. Once the oppos- 
ing ERASE mmf is large enough to destroy the 
fluxes encircling the outside holes, the mmf 
around the center hole is very intense and the 
available flux in the center legs is reoriented 
around the center hole as shown in Figure 3a. 
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The memory cells are normally arranged 
in multicell planes or matrices in which all 
cells are threaded by common Sense and Inhibit 
windings. Inhibiting is done on a planar basis 
and the effects of a random INHIBIT pulse on a 
non-selected cell must be considered. If the 
cell stored a ONE, the random INHIBIT pulse 
simply reverses the flux surrounding the center 
hole, just as a SET pulse would have done, and 
the stored ONE is unharmed. A READ sequence 
then applied to a cell in this condition would pro- 
duce the usual Sense winding ONE output in 
synchronism with the READ pulse. 

A random INHIBIT pulse applied to a cell 
storing a ZERO produces an mmf in the direction 
of the saturated flux around the right-hand hole 
and causes no flux change. It is seen that though 
the random INHIBIT pulse may cause flux changes 
in the unselected cells, no adverse effects occur 
and stored information is not destroyed. 

The flux distributions utilized to explain 
the multiaperture cell's operation have been 
verified experimentally by integration of the 
voltages induced in a winding threading one or 
more holes at a time. By this technique it was 
possible to observe the net flux about any single 
hole or through any of the various legs of the 
cell. Though the flux distributions thus obtained 
may undoubtedly be explained in other terms, 
such as those employing crescent- shaped flux 
patterns, the relatively simple explanation pre- 
sented here, supported by ample experimental 
evidence, is felt to be perfectly valid. 

Complement Switch Matrix 

The bipolar pulses required for the pulse 
program of the previously described multiaper- 
ture memory cell are conveniently supplied by a 
switch matrix of tape wound cores. To achieve 
the drive currents desired for high speed 
operation and still use transistors as the pulse 
generators, a non-conventional circuit is used. 

No dc reset current is used, allowing all of the 
transistor current flowing to be used for switch- 
ing. In the complement switch matrix to be 
described, the complement of the row and column 
address is excited, i. e. all rows and columns 
except those intersecting the desired core. For 
example, a 3x3 array is shown in Figure 4 with 
appropriate drive windings. For simplicity no 
reset or word drive windings are shown. All 
cores are initially mangetized clockwise. If it 
is desired to select core 9, all rows and columns 
other than row 3 and column 3 are excited at an 
amplitude I. The excitation received by cores 
1,2, 4 and 5 is equal to the algebraic sum of the 
magnetomotive forces due to the 4 turn row, 4 
turn column and single turn cancel windings. 

For these cores that is 4I+4I-4I=+4I, clockwise 


magnetization assumed positive. Thus none of 
these cores are switched irreversibly. Cores 
3, 6 , 7 and 8 are excited either by a row or 
column winding (not both) and the single turn 
cancel winding. Their excitation is therefore 
41-41=0. Selected core 9 is driven exclusively 
by the single turn cancel winding. Core 9 is thus 
excited by -41 which causes it to switch irre- 
versibly. The effect of this wiring scheme is to 
permit transistor currents to parallel into a 
single turn through the selected core. 

In a practical system a reset winding is 
wired to produce clockwise magnetization in 
every core. In the system to be described, two 
identical complement switch matrices are used. 
The cancel winding of one matrix is extended 
and threaded in reset fashion through the other 
matrix. The number of turns required for the 
row or column windings of an "n" square matrix 
is 2(n-l). Individual word drive lines are 
associated with each switch core on a core per 
word basis as shown in Figure 5. 

The transistor circuitry associated with 
the complement switch matrix drivers is shown 
in Figure 6 . Current routing, current mode 
switching and grounded base switching are used 
to achieve speed from economical transistors. 
When either T 3 or T 4 is driven by the excite 
pulse in the absence of a negate pulse on D, the 
emitter of is driven positive with respect to 
its base and conduction takes place. If a 16x16 
complement switch is used, 29 transistors 
associated with the other rows and columns are 
driven in like manner. One row and colum will 
be negated for address purposes. On the appli- 
cation of an excitation pulse simultaneously with 
a negate pulse, diode D conducts turning off Tj 
which drives the base of more positive than 
its emitter. T 2 therefore will not conduct these 
conditions. The voltage on the common bus is 
low due to the conduction of thirty transistors in 
parallel compared with the voltage developed a- 
cross the switched core. Independent supply 
voltages Ei and E 2 allow control of the pulse 
amplitudes and durations for two separate excite 
pulses later to be called ERASE and SET. 

Though this circuit appears deceptively simple, 
many subtleties caused other circuits to be re- 
jected in favor of this simple one. Enumeration 
of some of the more important considerations 
will be discussed during system description. 

Memory Cell Matrix 

Figure 7 illustrates a 3x3 memory plane 
wiring scheme for non-destructive readout of 
multiaperture cells showing planar Inhibit and 
Sense windings. Noise cancellation is used as 
in conventional memory planes, necessitating a 
separate inhibit line. 
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Memory System 

Figure 8 shows the block diagram of a test 
vehicle embodying previously described circuits. 

In operation one switch matrix is operated in a 
"selective mode", i. e. it selects the address of 
the word to be written or read. The other switch 
matrix is operated in a "non- selective" mode, to 
reset the selective mode switch matrix. Either 
of the switch matrices may be used for either 
mode, but for simplicity SM- 1 will be described 
in the complement selective mode and SM-2 will 
be chosen for non-selective mode reset. Thus 
SM-2 will be driven in such a manner that no 
cores in SM-2 switch by driving all rows and 
columns; but SM-2's current will switch the 
previously selected core in SM-1 back to the 
reset position. 

The program shown in Figure 2 will be 
used for a four pulse sequence of ERASE, WRITE, 
SET and READ. SM-1 driven selectively gener- 
ates a large ERASE pulse which clears all 
memory cells on the addressed word line. The 
polarity of the ERASE pulse is opposite to the 
READ and WRITE pulses. This operation is 
followed by driving SM-2 non- selectively which 
resets the addressed core of SM-1. A WRITE 
pulse is generated by this operation which is of 
sufficient magnitude to write a ONE in all ad- 
dressed cells. Those planes in which a ZERO 
rather than a ONE is desired are energized simul- 
taneously with an INHIBIT pulse applied to the 
memory plane inhibit winding. Both SM-1 and 
SM-2 are now in the normal state, that is all 
cores switched to the same sense. Clearly this 
operation may be followed either by another Write 
sequence or, as will be shown, by a Read sequence. 

The read operation consists of two phases; 
the SET and READ. SM-1 is driven selectively 
at an amplitude which will not clear cells in the 
ZERO state and for a duration sufficient to 
switch all the cell flux desired. Following the 
SET pulse, a READ pulse is provided by driving 
SM-2 non-selectively to reset SM-1. Simultane- 
ously a strobe gate energizes the memory plane 
sense amplifiers. If a ONE is present in a plane, 
sufficient output voltage will be generated to 
excite the sense amplifier. If a ZERO is present, 
the noise voltage is of insufficient amplitude to 
energize the sense amplifier. Strobing is neces- 
sary to prevent spurious outputs during the other 
operations. 

Governing factors pertaining to the pre- 
viously described row and column switching cir- 
cuit may now be considered. It had first been 
thought that adequate speed and current could be 
obtained by using a dc source and driving row and 
column gates in parallel to excite the switch 
matrix. The required independent control of 
duration and amplitude for the ERASE and SET 


pulses, however, make this method very diffi- 
cult to implement. Another fault of such a 
scheme is the loading effect caused by the tran- 
sistor drivers during reset operation. These 
loading effects may be due to one or all three of 
the following: 

1. Minority carrier storage 

2. Output capacitance of the driver 
circuits 

3. Conduction of the driver due to polarity 
of the induced voltage (such as emitter 
followers). 

In the configuration described, larger output 
currents result at speeds not possible in the dc 
collector supply type of operation. Further, the 
polarity of the induced voltage during reset aids 
turn off by negatively biasing the emitter of T£ 
in Figure 3. The adjustment of SET pulse ampli- 
tude may be performed in a single stage by a 
simple change of dc level. Negation of a row and 
column is independent of the applied ERASE and 
SET pulse amplitudes and durations, greatly 
simplifying the system. 

In the test vehicle shown, although random 
access is allowable, the test program used pro- 
vides sequential addresses via a shift register 
ring for negating one row and column at a time. 
This greatly aided trouble shooting during 
"de -bugging". 

The writers are indebted to W. A. Kluck 
whose basic ideas started this investigation and 
to W. T. McKay, Jr. for his circuit contribu- 
tions. 
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Figure 1. Dimensions and windings for a 
typical multiaperture memory cell. Typical 
thickness is about 0. 045 inches. 
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Figure 2a. Bipolar cur- 
rent pulse sequence ap- 
plied to Drive winding for 
storing either a ONE or 
a ZERO. 
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Figure 2b. Current 
pulse applied to Inhibit 
winding for inhibiting or 
writing a ZERO. 


Figure 2c. Bipolar READ 
sequence for interrogating 
multiaperture cell. 


Figure 2d. Typical ONE 
readout voltage from 
interrogated cell, show- 
ing coincidence of read- 
out and READ pulses. 



Figure 3a. Flux distribution resulting from 
application of ERASE pulse to a cell storing a 


ONE. 



Figure 3b. Flux distribution resulting from 
WRITE pulse. The cell is now storing a ONE. 



INHIBIT or WRITE ONE sequences. The cell 
is now storing a ZERO. 
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Figure 5. Driving core and word line scheme 
used to drive multiaperture memory cells. 


Figure 4a. Complement switch matrix. 

+ 41 +41 0 

+ 41 +41 0 

0 0 -41 

+ = Clockwise Initial 
Magnetization 
- = Counterclockwise 
Switching Direction 



Figure 6. Transistor circuitry for driving 
switch matrix. 


Figure 4b. MMF chart. 
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INHIBIT 



Figure 7. Typical memory cell matrix with 
planar Inhibit and Readout or Sense windings. 
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Figure 8. Test vehicle for system utilizing 
multiaperture memory cells. 
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Introduction 


The Unifluxor is a new binary permanent memo- 
ry element which appears to have the advantages of 
high-speed operation, easy fabrication, and low 
cost. Unlike cores, twistors, capacitors, and 
other commonly used memory devices, the Unifluxor 
does not depend upon the hysteretic properites of 
some nonlinear material but instead uses the in- 
ductive characteristics of magnetically coupled 
wires. 

A Unifluxor memory array consists of a 
printed-circuit board upon which are etched longi- 
tudinal drive lines and transverse sense lines. 

Each intersection of a drive and sense line repre- 
sents one bit, with the bits of one word lying a- 
long the same drive line. Thus the array operates 
in the word-organized mode. 

The particular state of each bit depends upon 
the presence ("one’’) or absence ("zero") of a cop- 
per slug. The copper slugs, properly spaced and 
oriented, are contained on a plastic film cemented 
in place over the printed-circuit board (Figure 1). 
In order to change the contents of the memory, it 
is necessary to substitute a new cover film with 
the desired pattern of copper slugs. 

Theory of Operation 

"Zero" State 

Consider a wire in the plane of and perpen- 
dicular to a closed wire loop (Figure 2). When 
the switch is closed and current flows in the wire, 
a cylindrical magnetostatic field is created 
around the wire, the effective boundary of which 
is indicated in the diagram by dotted lines. The 
total flux within the loop then is 0^ + 02- 

At the time the switch is closed and the 
field is building up, an emf (E) is induced in the 
loop according to Faraday's equation: 

E = -Nd0 t /dt 

= -N(d0i/dt + d0 2 /dt ) (1) 

Since 0 is equal to the product of the flux 
density (B) and flux area (A), equation 1 may be 
rewritten as : 

E = -N [d(B 1 A 1 )/dt + d(B 2 A 2 )/dt] (2) 

For the case illustrated, the following equalities 
exist : 

B 1 = -B 2 
A 1 = a 2 


Thus, the emf’s generated in the two halves 
of the loop are equal in magnitude but opposite in 
sign, and the net output is zero. This in fact is 
the condition for a Unifluxor in the "zero" state 
where the loop is the sense line, the wire the 
drive line, and the switch the logic which selects 
the particular drive line. 

"One" State 

Consider a second loop-and-wire arrangement 
similar to the first (Figure 3). Here, however, a 
copper slug has been placed in the path of 02. At 
the time the switch is closed and the field is 
building up, Eddy currents and other losses (de- 
pending upon the configuration) are induced in the 
slug. The effect of the losses is to reduce B 2 
such that: 

B»2 = B 2 -AB2 

If the new value B' 2 is substituted in equation 2, 
it becomes evident that a net emf proportional to 
AB 2 will be induced in the loop and a signal will 
appear at the output transformer. A similar sig- 
nal but opposite in sign will appear when the 
switch is opened and the field collapses. This is 
the condition for a Unifluxor in the "one" state. 

Design Considerations 

The amplitude of the "one" signal depends 

upon : 

1. The rate of change of the current in the 
drive line; 

2. The final amplitude of the current; 

3. The spacing between the two legs of the 
sense line loop, and 

4. The amount of flux imbalance introduced 
by the copper slug. 

Each of these parameters influences the design of 
an operating memory. General considerations are 
discussed below, followed by values of the para- 
meters chosen for a working model of the Unifluxor 
memory . 

The need for a rapid build-up of a current 
limits the length of the drive line, since as the 
line is made longer its inductance becomes larger 
and hence objectionable. The desired cycle time 
also influences to a Trailer degree the maximum 
allowable length. 
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The final amplitude of the current determines 
the effective boundary of the flux of the drive 
line, that is, the point at which B(i max )is essen- 
tially zero. The drive lines adjacent to the se- 
lected line must fall outside this boundary; 
otherwise noise will occur whenever the drive line 
on one side of the selected line has a slug and 
the line on the other side does not. The ampli- 
tude of the current thus determines the minimum 
spacing between drive lines. 

The position of the effective boundary of the 
drive line field is one factor in determining the 
flux area; the second factor is the spacing be- 
tween the legs of the sense line loop. In order 
to obtain an area of sufficient size and conse- 
quently a "one" signal of reasonable strength, a 
compromise must be reached between current ampli- 
tude and the spacing of the loop. For a very 
small current, if the drive lines are close to- 
gether, the sense line legs must be widely spaced, 
thereby keeping the over-all size of the array 
relatively constant. 

The amount of flux imbalance caused by the 
slug is determined by its position, size, and 
shape. Theoretically, an open slug which com- 
pletely cancelled the flux in area A2 would cause 
a maximum A^, the decrease resulting not only 

from Eddy current losses but also from the field 
of the counter-emf that would be induced in the 
loop. In practice, a solid rectangular slug is 
sufficient, yielding a strong "one" signal with 
excellent signal-to-noise ratio. 

Working Model 
Physical Arrangement 

A working model of the Unifluxor memory has 
been constructed for experimental purposes. The 
memory has a capacity of 64 words of 50 bits each. 
The 64 words are contained on two arrays of 32 
words each. The drive lines of the two arrays are 
shared in common; the sense lines are independent. 
The configuration of the memory element used in 
the model is shown in Figure 4, which is an en- 
larged diagram of several bits. 

Note that the sense line crosses the drive 
line at an angle. The adjacent drive lines are 
crossed at the same angle but in the opposite 
direction. Skewing the sense lines in this way 
serves two purposes: It permits the slug to be so 
placed as to interrupt B2 in two areas, thus 
achieving more imbalance, and in addition it tends 
to cancel any noise originating from nonselected 
drive lines. 

To read any given word requires full current 
on the selected drive line and no current in the 
other drive lines. The selection must be near per- 
fect, else each leakage current will cause small 
outputs from the "one" bits through which it passes. 
This noise, multiplied by the number of drive lines 
linked by the same sense line, can reduce the sig- 
nal-to-noise ratio and mask the selected bit. 


One way of obtaining perfect selection is to 
use a separate driver for each bit. The economic 
obstacle this method imposes, however, obviates 
its use in a large memory. For the laboratory 
model a switch-core selection system was chosen 
which involves four drivers, eight gates, and 32 
switch cores. Each switch core selects two drive 
lines, one on each of the independent arrays. 

The outputs from the two corresponding sense 
lines appear at gates at the input to the sense 
amplifier. The gates permit the output from only 
one sense line to pass at any one time. 

Two different sense amplifiers have been de- 
signed. One amplifier produces an output when- 
ever any negative-going pulse appears at its in- 
put. No output is produced for positive-going 
inputs. The second amplifier produces an output 
only if the leading pulse is negative, with no 
output under other conditions. Both amplifiers 
have extremely fast recovery times. 

A full discussion of the address logic is 
omitted from this paper, since the logic is similar 
to many other addressing schemes for word- organized 
memories. However, a novel method of selecting the 
drive cores has been devised, which reduces the 
noise level far below that usually experienced. 

The principle is illustrated in Figure 5. In 
quiescent state, the address translator holds the 
driver and gates cut off. (The full schematic of 
only one gate is shown, the circuits of the others 
being identical.) No current flows through the 
core winding associated with gate 1 because of the 
blocking action of the diode. So also for the 
other two core windings: No current flows. 

Assume that the driver and gate 1 are now 
simultaneously enabled, as indicated by the small 
waveforms on the diagram. A low-resistance path 
from ground is created through Ql, through the 
winding of the left-hand core, through Q2 to the 
positive side of the supply. The magnitude of the 
current through the winding is sufficient to select 
the core, and an output pulse appears on drive 
line 1. No current flows in the nonselected core 
windings because gates 2 and 3 remain closed. 

Although it would appear possible theoreti- 
cally to eliminate the drive core and let Ql 
drive the drive lines directly, in practice this 
causes ringing on the drive line. 

Electrical Characteristics 

A 500-mil liampere pulse 150 millimicroseconds 
wide is used as the drive current. The flux 
created by the current at its maximum has a density 
of approximately one gauss at a distance one milli- 
meter from the drive wire, as calculated from the 
equation : 

B(gauss) — 1 10^/27rR)0 

This equation considers the drive line as 
infinitely long, an assumption justified by the 
ratio of its actual length to the diameter of the 
of the effective field. The value of 4 may be 
taken as that for free space (4TT10 r '7). 9 is a 

unit vector normal to the drive line. 



The net emf induced in the sense line by a 
"one" bit has a range of 8 to 12 millivolts. The 
corresponding output from the sense amplifier is a 
rise from -4 volts to 0 volts. Figure 6 shows the 
sense amplifier outputs for both "one" and "zero". 
The signal-to-noise ratio is greater than 15 to 1. 

At present, the memory is being operated at 
interrogation intervals of 420 millimicroseconds. 
An interval of 350 millimicroseconds appears possi- 
ble with the present design. The maximum possible 
interrogation rate of the memory element itself is 
not known, the recovery time of the sense ampli- 
fiers and the speed of the address logic being the 
limiting factors in the present design. No heating 
or other adverse effects have been noted from re- 
peated interrogation of the same drive line. 

Fabrication 

The drive lines are etched on an epoxy glass 
substrate and made flush with the surface by 
heating the substrate to 375°F and placing it in a 
50-ton press. A Mylar film with a copper sheet 
bonded to it is cemented in place on the substrate, 
and the sense lines are etched from the copper 
sheet. 

The copper slugs are etched on a second Mylar 
film, with a slug in each bit position. When the 
contents of the memory are determined, the slugs in 
the "zero" positions are punched out and the film 
cemented to the sense and drive line array. 

(For customer use, thin stiff cards appear 
more desirable as a base for the slugs. A mechani- 
cal arrangement which holds the cards firmly in 
position against the control lines without cement 
has been developed, and the cards may easily be 
exchanged. A device to prepare cards with the de- 
sired slug arrangement semiautomatically is now 
being investigated. Using standard punched paper 
tape as input, the device activates a bank of lamps, 
exposing photo-resist material on the previously 
sensitized card. The card is ready for use after 
development and cleaning.) 


3.2 

None of the fabrication processes are critical 
or expensive, nor do any require highly skilled 
operators. In the model built for use in the labo- 
ratory, a density of approximately 200 bits per 
cubic inch was achieved with little effort. The 
array may further be scaled down by decreasing the 
distance between drive lines and the width and 
separation of the sense line loops. (In the labo- 
ratory model, these dimensions were made conserva- 
tively large. ) 


Conclusions 

The Unifluxor is particularly suited for in- 
struction storage in such fixed-program machines 
as process control and missile guidance computers. 
The speed of the element is sufficiently high as 
to adapt it for use in next-generation machines. 
Although life tests to prove its reliability have 
not yet been performed, the element itself does not 
appear to have any characteristic that would 
change either with time or operation, and hence 
the reliability of the memory control circuits 
would be the governing factor. 

Another potential application now being in- 
vestigated is dictionary storage for machine trans- 
lation. The low cost of the Unifluxor may make 
possible random-access memories of a size hitherto 
unfeasible economically. If the memory element 
itself was the only aspect of this problem, the 
Unifluxor would solve it; the cost of the control 
circuits, however, remains and must first be re- 
duced. 
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Figure 1. Simplified Diagram of Unifluxor Memory 



Figure 2. "Zero" Configuration 
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Figure 3. "One" Configuration 



Figure 4. Enlarged Section of Unifluxor Array 
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DRIVE LINE I DRIVE LINE 2 DRIVE LINE 3 



Figure 5. Core Selection 



PATTERN 1011 IS THE RESULT OF 
INTERROGATING FOUR SEPARATE 
DRIVE LINES IN SUCCESSION. 


Figure 6. Sense Amplifier Output 
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CHARACTERISTICS OP A MULTIPLE MAGNETIC PLANE THIN FIIM MEMORY DEVICE 


Kent D. Broadbent 
Hughes Research Laboratories 
Culver City, California 


Summary 

In a previous paper, 1 a ferromagnetic film 
memory was described having a complex structure of 
evaporated magnetic, dielectric and conducting 
materials. The multiple plane geometry was design- 
ed to enhance the characteristics which are desir- 
able in practical digital computer memories without 
requiring extreme uniformity and control of the 
magnetic properties. 

This paper describes the electrical char- 
acteristics of prototype memory elements, and 
discusses their application in computer memory 
systems. A short review of the basic magnetic 
structure and mode of operation is given first 
to provide a common reference for discussion. 

Magnetic Structure and Mode of Operation 

The structure to accomplish the selection 
and storage functions within the matrix is com- 
prised of four ferromagnetic films appropriately 
interlaced with evaporated drive and read-out 
conductors, and insulators. All nineteen layers 
(Figure l) are vacuum deposited. Figure 2 repre- 
sents schematically the reference (binary l) state 
of the four magnetic layers. The dimensional 
ratios are shown highly exaggerated for clarity. 

The four-layer magnetic complex is in a high 
or low magnetostatic energy state depending on 
the sense of M vectors within the films. The low 
energy, statically stable states are those in which 
two M vectors point in a direction opposite to the 
remaining- two. Any other configuration results in 
an unstable high energy state which quickly decays 
to a stable condition. The difference in film 
widths causes this decay to proceed by a path 
involving least expenditure of energy. Due to 
the difference in the volumes of magnetic material, 
this minimum-energy transition occurs by reversal 
of the uppermost film available for reversal. 

This is illustrated in Figure 8. In going from 
the "l r " state to either of the "l<q" states, the 
uppermost film available for reversal is the 
third layer. The fourth (read-out) film thus 
remains in the original direction and no voltage 
is induced in the sense conductors. It is seen 
that any subsequent pulses applied to either X or 
Y merely affects the first two layers; only co- 
incident drives on X and Y can establish either 
the "0 r " or "l r " states. 


Sei Shohara and George Wolfe, Jr. 
Hughes Ground Systems Group 
Fullerton, California 


Electrical Characteristics 

Of primary interest in memory applications 
are the switching characteristics and the selec- 
tion ratio of the basic memory cell. Also, in 
the case of evaporated memories, RLC parameters 
are of added importance because of the extremely 
small dimensions attainable. These are now given 
for the prototype elements. 

Resistances of the conductors of prototype 
elements are centered around two ohms per element. 
Inductances are in the millimicrohenry range, and 
capacitances between conductors range from 80 up 
to 200 micromicrofarads per element. The magnitudes 
of resistance and capacitance pose some problems 
in designing an optimum practical array, as dis- 
cussed later. 

To obtain the switching characteristics, 
pulse sources of variable amplitude and polarity 
are applied to the X and Y drive conductors connect- 
ed in series, as shown in Figure 3. The peak out- 
put voltage and switching time (measured between 
10 percent points) is plotted against current 
pulse amplitude in Figure 4. The "disturbed 0" 
outputs normally plotted on graphs of this type 
are not shown here since it could not be measured 
with the existing equipment. This is discussed 
further under selection ratio tests. The rise 
time of the driving pulse is 20 mps throughout 
the entire range of currents. The leveling-off 
effect of switching time at high currents can be 
ascribed partly to the finite current rise time. 

This was checked by using a mercury relay pulser 
with 1 mps rise time and a traveling wave scope. 

At a maximum current of 3 amperes, the limit of 
the pulser, the switching time observed was 10 mps 
shorter than shown in the plot of Figure 4. The 
switching times below 1,2 amperes are identical 
with the values of Figure 4, showing that switch- 
ing time is amplitude limited below 1.2 amperes 
for 20 mps current rise time. The relay pulser 
was not used to plot Figure 4 for the reason 
that a high-current transistor driver capable 
of rise times below 20 mps do not appear practical 
at this time. On the other hand, rise times 
significantly greater than 20 mps would begin to 
hinder the inherent speed capacity of the device. 

The figure of 20 mps was thus chosen as the best 
compromise. 

In the low current region, switching can he 
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detected at currents as low as 160 ma. if suf- 
ficient pulse width (0.6 ps) is used. 

To determine the selection ratio, the circuit 
of Figure 3 is slightly modified so that the X 
and Y windings can be driven separately by adding 
another pair of pulse generators. The X winding 
is driven by 3 amperes (limit of equipment) 
rising in 20 mps in the READ direction and 500 ma. 
in the WRITE (reset) direction. The Y winding 
is driven with 500 ma. in both directions. The 
photograph in Figure 5 shows superimposed traces 
of the output voltages, all during READ time, 
under three different conditions: 

1. Only X is driven in the READ direction 
followed by a coincident WRITE drive. 

2. WRITE drives are removed. 

3. Both X and Y are driven coincident ly 
in the READ direction followed by 
coincident WRITE drives. 

The output voltage resulting from the third 
condition represents a full select ”1” and is 
shown for reference. The second output is the 
"zero" signal reference since no writing occurs 
prior to READ, and the output for condition 1 
shows the effect of a 3-ampere half-select pulse. 
The difference in output voltages between con- 
ditions 1 and 2 is due to the presence of Y drive 
during condition 2 but not during condition 1. 

The ringing is due to the combined effect of 
element capacities and the access leads from the 
prototype measuring jig setup. It is seen that 
the memory element is not disturbed by half-select 
currents which are greater than an order of mag- 
nitude times the current amplitude sufficient to 
initiate switching (l60 ma.) when applied co- 
incidently. Exactly the same results are obtained 
when the Y winding is half-selected although its 
associated magnetic plane occupies a position 
closer to the output magnetic film. 

Besides the switching characteristics and 
selection ratio, another parameter of interest, 
especially in fast memory systems, is the delay 
of the output signal in propagating over the 
sense conductors. The problem is accentuated 
in the thin film element because the extremely 
small dimensions result in larger values of 
capacity (as well as resistance) than are 
encountered in ordinary core systems. The effect 
of these were measured as shown in Figure 6 by 
introducing a 5 millimicrosecond pulse in a 
sense winding passing through several elements. 

Two traces are photographically superimposed on 
a common baseline, one with only helix A connected 
and the other with only helix B connected. The 
time displacement between the two waveforms con- 
stitute the total delay. Any possibility of 
inequality in electrical line lengths is can- 
celled out by repeating the measurements with 
A and B helices interchanged. The average delay 
per element was observed to be about 0.3 mps. 

This is somewhat larger than is desirable for 
a memory of large size, so that the elements 
currently under development have reduced 


capacitance, with some sacrifice in output volt- 
age amplitude, resulting in reduction of delay 
by an order of magnitude. 

Memory Organization 

The individual memory cells and interconnect- 
ing conductors of prototype matrices are evaporated 
in a manner such that the operation of the memory 
is similar to the conventional coincident current 
memory plane. That is, the coincidence of a X 
and Y drive current select a memory cell. One 
important difference is that the X and Y drive 
current amplitude can exceed the switching thres- 
hold by a factor of 20 or more without affecting 
the information state of a memory cell. This is 
because of the virtual magnetic decoupling between 
the X drive layer or Y drive layer from the sense 
layer when either the X or the Y drive line are 
selected individually. The switching speed of 
the selected memory cell is then a function of 
the drive current. Switching time in the order 
of 40 to 80 millimicroseconds are obtained with 
relatively low amplitude drive currents. We now 
consider some of the factors affecting the design 
of arrays. 

Figure 7 illustrates a possible arrangement 
of memory elements and planes. Sense lines are 
not shown, for clarity. Assume that suitable 
bipolar drivers are tied to the X, Y access lines, 
providing READ and WRITE current pulses. It can 
now be shown, with the aid of Figure 8, that any 
sequence of non- coincident drive currents cannot 
alter the stored information although the M-vector 
configuration can enter different statically- 
stable magnetic states. In Figure 8, the l r and 
0 r states are taken as the initial states, and 
all possible configurations arising from subsequent 
non-coincident drives are shown for both cases. 

It is first noted that it is not possible to go 
from the l r state to the 0 r state or vice versa 
with any sequence of disturb pulses. Second, 
once a Id or Od state is entered, it is not pos- 
sible to return to the reference states. In all 
cases the fourth layer is magnetically decoupled 
from the driven layers. Further, once the dis- 
turbed states are entered, the two bottom layers 
are not affected by further disturb pulses. This 
"decoupling" feature implies a somewhat more sub- 
tle writing technique than core memories using 
inhibit currents for digit control. 

Memory organizations based on biasing a 
winding or adding (or modifying) a conductor 
(to obtain economy in the driver count) are 
presently under evaluation. One important 
consideration in this evaluation is developing 
an optimum method of interconnecting the elements 
to obtain cancellation of capacitive ly coupled 
noise, analogous to the use of diagonal sense 
windings in core arrays to cancel inductively 
induced noise. Another consideration is the 
effect of the relatively high resistivity of 
the conductors. If IR drops are traced in 
Figure 7, it can be seen that care must be 
exercised to insure that the voltage drops in 



99 

3.3 


access lines do not cause excessive voltage gra- 
dients across the extremely thin insulating 
layers. 

In applications involving large numbers of 
cells a reduced cell length and reduced magnetic 
film thickness is used. One of the virtues of 
the cell is that, where desirable, thicker films 
and longer lengths may be employed to achieve 
substantial outputs, such as with the .200" cell 
in obtaining all the previous data. However, in 
large systems the problems of power dissipation 
per cell, back e.m.f. per cell, output signal 
delay per cell, and maximum storage density per 
surface area dictate the use of a smaller cell. 
Smaller size is feasible because of the vacuum 
techniques employed, and the use of a reacting 
system such as this, where flux may be switched 
at any location, becomes practical due to the 
minute amounts of magnetic material that can be 
employed using vacuum, vapor-phase processes. 


Conclusion 

The multi-layer memory elements offer con- 
siderable promise in application to high speed 
memory systems. A number of problems remain to 
be solved in designing peripheral circuitry 
compatible with the speed capabilities of the 
device, and in improving certain characteristics 
of the device itself. These are currently under 
investigation. 
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Introduction 

The Convair-Astronautics Time Delay 
System is being developed to make pos- 
sible the delay of analog functions over 
a greater range than is possible with 
more conventional means. The system is 
designed to handle ten channels of analog 
data, delay each for the same time inter- 
val, and reproduce the functions within 
0.1% of the original composition for all 
frequencies up to 100 cycles per second. 
The delay is required to be variable from 
0.01 to 10 seconds, +5 milliseconds. 

This paper is concerned with the system 
design and will not attempt to discuss 
applications. 

In order to obtain the wide delay 
range required, a special magnetic tape 
system is employed. The delay is 
achieved by recording information on the 
tape with a recording head, and reading 
the information back at the prescribed 
time with a separate head. The tape 
travels at a uniform rate in a contin- 
uous loop. The delay is therefore 
directly related to the length of tape 
between the write and read heads. This is 
controlled by a servo system which con- 
tinuously monitors this "delay" loop and 
compares it with a value preset directly 
in milliseconds. The high degree of ac- 
curacy is obtained by converting the ana- 
log input voltages to digital form and 
recording It. A single analog- to digital 
converter is used for this purpose with 
the various analog Inputs being commuta- 
ted to the converter. The read head 
retrieves the digital information from the 
tape after the desired delay. The infor- 
mation is then re-comnutated to ten 
digital-to-analog converters. The use of 
separate digital-to-analog converters for 
each input-output channel eliminates the 
need for holding amplifiers and commuta- 
tion of the analog output voltages. Cor- 
relation between input and output commuta- 


tion and delay control is achieved 
through a central timing system. The 
overall flow of information is shown in 
Figure 1. 

Tape Transport 

The tape transport is designed to 
use a 100 foot continuous loop, one inch 
magnetic tape. Information is recorded 
non-return to zero on the tape by a 14 
channel recording head designed to handle 
a forty kilocycle repetition rate as the 
tape moves at 100 inches per second. At 
the prescribed time later, this informa- 
tion is retrieved by the read head as the 
tape continues in its loop path. Since a 
delay of from 0.01 to 10 seconds is re- 
quired, the amount of tape between the 
recording and the reading heads must be 
variable from one to one thousand inches. 
This tape is loosely piled in the tape 
delay storage compartment. The tape is 
pushed over the recording head where air 
pressure assures contact with the head. 
This is necessary since, to achieve the 
minimum one inch loop, it is impractical 
to place the drive capstan between the 
recording and reading heads. If the 
delay loop exceeds four inches, the tape 
is picked up below the recording head by 
a vacuum capstan which helps maintain 
tension in the tape across the recording 
head. with a delay loop of less than 
four inches, the tape pulls free of the 
vacuum capstan and its own centrifugal 
force supplies the required tension. The 
tape, after passing the read head, moves 
through a compartment similiar to the 
delay storage compartment where all tape 
not used in the delay loop Is stored. 
Vacuum is used to maintain tension in 
the tape wherever it is necessary. The 
principle features of the tape unit are 
shown in Figure 2. 
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Timing 

As shown In Figure 3, the heart of 
the timing system is a 960 kc crystal 
clock which generates the fundamental 
frequency used by the digitizer for bit 
selection. This frequency is divided 
by twelve to obtain an 80 kc rate used 
by the delay servo counter. Dividing 
again by two gives the fundamental l+O kc 
sampling rate. A ten bit shift register 
uses this rate for input and output 
commutation and channel identification. 

Delay Servo 

The tape transport uses two three- 
phase synchronous motors, one to drive 
the write and vacuum capstans and the 
other to drive the read capstan. The 
first motor is driven directly from the 
60 cycle three phase line while the read 
motor is driven by a variable frequency 
(three-phase) oscillator. The frequency 
of this oscillator is controlled by a 
voltage level corresponding to the out- 
put of a comparator. The recording head 
records the Lf.0 kc clock on the tape and, 
at the same time, adds it into a counter. 
The read head subtracts this I4.O kc signal 
out of the same counter so that the bal- 
ance in the counter represents the dif- 
ference between the counts recorded and 
the counts read. This balance is pro- 
portional to the delay in the system and 
and is continuously compared against a 
preset count representing the delay re- 
quired, set directly in milliseconds. 

The output of the comparator then regu- 
lates the speed of the read motor as 
discussed. This system is shown in 
Figure if. 

Analog-To-Dlgltal Conversion 

The analog-to-digital converter 
samples the input analog voltage and con- 
verts it to digital form using ten bits 
for amplitude representation plus one bit 
to indicate polarity (sign). To obtain 
the required accuracy, each channel of 
input information is sampled at a repeti- 
tion rate of four thousand samples per 
second. Since a single digitizer is used 
for all ten channels of analog inputs, 
the overall sampling rate of the digit- 
izer is forty thousand samples per second. 
A ten bit shift register driven by the 
forty kilocycle clock is used to activate 
diode bridge switches for sequential input 
commutation. The digitizer consists of 
eleven flip-flop operated bit switches 
which contribute current through cali- 
brated resistors to a summing junction to 
which the input signal is also connected. 
The polarity of this summing junction is 
the normal carrier signal input to a 


"ring modulator". The signal input of 
the "ring modulator" is the 960 kc clock 
frequency. The output is a 960 kc signal 
which leads or lags the clock by 90 de- 
grees depending on the polarity of the 
summing junction. By comparing the phase 
of the clock with the modulator output, 
the summing junction's polarity is deter- 
mined. 

The common "ripple -down" method of 
digitizing is used. Starting with the 
most significant bit, the bit switches 
are turned on sequentially by an eleven 
bit shift register driven by the 960 
kilocycle clock. Each bit switch acti- 
vates a zener diode regulated, negative 
voltage source which is summed through 
a calibrated resister to the summing 
junction. The current contributed by 
each bit is proportional to the signif- 
icance of the bit. If the summing junc- 
tion goes negative, the bit reset gates 
are activated so that when the next bit 
switch is turned on, the proceeding 
switch is turned off. In the event the 
input analog voltage is negative, the 
sign bit is activated which contributes 
a positive current to the summing junc- 
tion of a magnitude equal to that con- 
tributed by a maximum input voltage. The 
effect of the negative input voltage and 
the sign bit is a current equal to a 
positive input which is the complement of 
the absolute value of the input signal. 

As a result, the digitizer will always 
handle a positive input voltage. When 
all ten bits have been sampled a "trans- 
fer" pulse is generated by the eleventh 
bit of the shift register. This pulse is 
"and" gated with each bit switch through 
eleven transfer gates so that each trans- 
fer gate corresponding to each "on" bit 
switch will pass the pulse. The output of 
each transfer gate is coupled to a sym- 
metrically driven flip-flop which serves 
as a write head driver amplifier. The 
eleven channels of digital information are 
thus recorded in parallel on eleven of the 
fourteen channels of the tape. A change 
in direction of saturation of the tape 
corresponds to a "one" being recorded, 
while no change represents a "zero". 

Figure 5 is a block diagram of this sys- 
tem. Figure 6 is a time versus sequential 
operation chart of the system. 

Digital-To-Analog Conversion 

This section, shown in Figure 7, in- 
cludes fourteen read amplifiers, eleven 
of which receive the digital information 
from the tape; ten groups of eleven out- 
put commutation gates; a ten bit shift 
register; and ten digital-to-analog con- 
verters each consisting of eleven bit 
switches, gates, and a summing amplifier 
with a low pass filter. The outputs from 



the eleven read amplifiers (pulse for a 
"one”, no pulse for a "zero") are gated 
to the proper output switches by the 10 
bit shift register. This shift register 
is started in the output channel No. 1 
position by a pulse from the tape which 
indicates that input channel No. 1 is 
being read, and is then advanced by the 
recorded forty kilocycle clock as the 
other input channels are read. The out- 
put bit switches are flip-flops which 
control diode bridge output gates. These 
gates apply a standard reference voltage 
through a calibrated resistance to the 
summing junction of the output amplifier. 
The summation of all the contributions 
from the on gates reproduces the analog 
equivalent of the digital information 
received from the tape. 

Additional Features 

The limits on the delay of 0.01 to 
10 seconds have effectively been expanded 
by use of a "recirculate" provision, mak- 
ing it possible to transfer information 
from any output channel back to any input 
channel without cumulative errors. This 
is done by making available at a patch 
panel all bit control flip-flop outputs 
which may be gated directly back to the 
write head driver amplifiers so that they 
are re-recorded without any conversions 
in place of the usual input information. 
This enables a single channel of contin- 
uous information to be delayed up to 100 
seconds by recirculating the information 
through all ten channels successively. 
Many variations are possible which make 
the recirculate feature a versatile in- 
strument. This system is completely 
transistorized. Printed circuit boards 
are mounted in slide-out racks making 
it possible to check any circuit during 
operation. Major system check points 
have been brought out to a test jack 
panel enabling checkout with a minimum of 
effort. 



FIGURE I 

INFORMATION FLOW DIAGRAM 
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TIMING 





FIGURE 2 

TAPE TRANSPORT 



FIGURE 3 

CENTRAL TIDING SYSTEM 
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ANALOG TO DIGITAL CONVERSION 
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FIGURE 7 
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DAFT— A DIGITAL/ANALOG FUNCTION TABLE 

by 

R. M. Beck and J. M. Mitchell 

Packard Bell Computer Corporation 
Los Angeles, California 


Abstract 


The design of a digital function generator 
based on incremental digital computer techniques 
is described which is capable of generating both 
mathematical functions and arbitrary functions. 
The mathematical functions are generated using 
parallel Digital Differential Analyzer methods. 
The arbitrary functions are obtained by generat- 
ing interpolating curves through programmed 
numerical points. The mathematical functions 
can be generated with a high degree of accuracy 
and repeatability while the arbitrary functions 
can be generated to a high degree of repeat- 
ability but an accuracy limited by second order 
interpolation methods. 

The interpolating equation generates a 
curve made up of a chain of parabolic segments 
passing through thirty-two numerical points 
which may be unequally spaced along the X axis. 
Both second order and first order interpolation 
may be performed. 

Fourteen bit incremental Analog-to-Digital 
and Digital-to-Analog converters are included 
which enables the use of the DAFT system in con- 
junction with analog computer facilities. A 
3 megacycle clock rate is used, enabling the 
transfer of information at 100,000 increments 
per second allowing for very high speed 
computation. 


Introduction 


This paper describes the design for a set 
of incremental computer blocks. The basic 
blocks of this system are shown in Figure 1. 

The incremental inputs and outputs of a number 
of these blocks may be interconnected to solve 
a large class of research and development 
problems. The rate of transfer of information 
between such blocks is 100,000 increments per 
second which allows for very high speed computa- 
tion. The operation of the blocks is numerical 
to allow for very high computational accuracy. 
This form of computer system should prove 
particularly useful for solving those complex 
problems in which present analog computers 
exhibit prohibitive total errors. 

The Integrator, Variable Multiplier, 
Constant Multiplier and Servo Units are of the 
Digital Differential Analyzer type. The DAFT 
Unit is a Digital/ Analog Function Table. The 


Analog-to-Digital and Digital-to-Analog 
Converter Units operate incrementally to convert 
between voltage information and digital informa- 
tion. These converters provide communication 
links between the incremental computer and 
either an analog computer or a process control 
system. The operation of these interconnected 
blocks is controlled by a central Control Unit. 
This Control Unit serves to enter problem para- 
meters, monitor the operation of the system, 
and synchronize the system operation. This 
equipment can be used to generate many mathemat- 
ical functions such as shown in Figure 2. 

When a function cannot be expressed 
satisfactorily in an analytic form, the DAFT 
unit may be used to generate it as an arbitrary 
function as shown in Figure 3« The mathematical 
functions can be generated with a high degree of 
accuracy while the arbitrary functions can be 
generated to a high degree of repeatability but 
an accuracy limited by second order interpolation 
methods. 


DAFT Function Generator 


General 


One particular interconnection of incre- 
mental units (see Figure 3) will be given 
special attention in this report. This system 
of units accepts an input voltage, X, generates 
a function, f(X), from a numerical function 
storage and generates Z = f(X) as an output 
voltage. In this way an analog function genera- 
tor is obtained with a very high degree of 
repeatability. 


Z - f(X) 


X (Input Voltage) 



(Output 

Voltage) 



A to D 

dX 

DAFT 

3f(X) 

D to A 



Converter 


Unit 


Converter 



Fig. 3 Arbitrary Function Generator 
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Incremental Technique 


It is characteristic of incremental computers 
to generate fewer dZ output increments than dX 
increments. If the X scale of the graph, 

Z = f(X), is divided into 16,384 dX increments 
and the Z scale is divided into 16,384 dZ 
increments, the Z function cannot generally 
extend over this full scale. Figure 4 indicates 
the range of Z for several types of functions. 


16,384 dX's 

A 



16,384 dZ's 


Fig. 4 Incrementally Generated Curves 

When the curve is Z = X, the curve can 
extend over 16,384 dZ's. The maximum excursions 
for other curves are given below: 


Yz cycle sine 

1 cycle sine 

2 cycle sine 
4 cycle sine 
8 cycle sine 


curve 10,420 dZ's 

curve 5,210 dZ's 

curve 2,605 dZ's 

curve 1,302 dZ's 

curve 651 dZ's 


This does not mean that the output amplitude 
necessarily becomes reduced with higher frequency 
curves (the output amplitude can be scaled up at 
either the D to A converter or by an output 
amplifier), but that the resolution between out- 
put values becomes coarser. 


scale may be visualized as divided by 129 grid 
lines with a spacing of 128 dX's between lines. 
The values of Z at any 31 of these grid lines 
may be used to define the curve provided that no 
more than 15 grid lines are skipped as a group. 
It is assumed that the first and last grid lines 
are always used to give 33 values of Z or 3 2 
AX's and 32 AZ's. 

The Z values are measured in units of dZ 
increments. The values used for Z and X must be 
chosen subject to some restrictions. 

The first restriction is that AZ^/AX^ 

= n^/ 128 , where n^ is an integer from -128 to 
+127. This states that slope, AZ^/ AX^, is 

restricted to one of 256 different values (since 

AX. = 128 N. , n. = AZ./N. ). 

1 1111 


The second restriction is that one of the 
X-Z points that is chosen be the origin. This 
requires that curves which do not pass through 
the origin be displaced vertically be the con- 
stant, f(0). This constant can be readily 
added back to the generated curve in the analog 
computer as well as in the incremental computer. 


The third restriction is that the change of 
slope between AX increments be such that 

n. n - n. = m. where m. is an integer from 
1 +1 1 1 1 


-128 to + 127 . 


The fourth restriction is thatj n^ - Yz 
128 which may also be expressed as 1 3/2 n. 


<128. This is necessary to limit 


- ' k n i ♦ 1 

the slope used in parabolic interpolation and 
will be mentioned later. Note that if the other 
restrictions are satisfied this restriction will 
be satisfied whenever I n . | < 64. 


Interpolation Method . 

The interpolation formula to be used in the 
region X^, is the following: 


The scaling of X at 16,384 dX's is consis- 
tent with the resolution of a high precision 
converter and is chosen to give a fine resolution 
for the output generation. 

Numerical Definition of Z = f(X) Curve 



The curve to be definedis to be divided 
ij».to 32 AX increments along the X scale. Each 
AX ± will have (128)(N^) of dX increments, 

where N^ is an integer from 1 to 16. The nominal 

value for N. is 4 so that the nominal value of 
1 

AX^ is 512 dX's. Individual AA 's may be one- 

fourth the nominal or four times the nominal, 
but the sum of the 32 N^ ' s should be 128 so as 

to span the full scale of 16,384 dX's. The X 


n. 

K \ 

/ x - x i-iV 

its 


Ml 


dX-/z- 



dX. 


Figure 5 shows the interpolation curves for 
Z in the intervals, AX^ and AX^ 
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X o AX 1 x ± AX 2 x 2 A x 3 


Fig. 5 Interpolation Curves 


In general, the interpolation formula yields 
a curve that is a chain of parabolic sections. 

The parabola used in the increment A)k passes 

through three points: 


2 ) 

3) 


X i’ 


Z. 

x 


x . +AX., 


z. 

i 


+ 


Az 

fix 


i + 1 
i + 1 


A x ± 


This linearly interpolated third point is 

much easier to use than ( x . , Z. , >. for the 

1+1’ l+l) 

third point of the parabola. 

The interpolation system is indicated by 
the block diagram in Figure 6. The dX incre- 
ments, of which there are 128 N. as x varies 

l 

through A x i? are scaled down in number by 1/N_^ 

to give 128 increments perA x ^. The increments 

are scaled down further in number by 1/128 to 
give one increment per A x ^. These i increments 

are summed in the i counter to control the 
selection of m. and Ik from the storage. 

As x varies through A)k, is added into 

the A register 128 times. The A register has 
64 in it at the beginning of A x ^ (this is 

1000000 in binary). After nu has been added 

into the A register 128 times, the A register 
will have overflowed nu times and again have 64 

in it. The overflows of the A register are 
summed in the B register. At the beginning of 
eachA)k increment, the B register contains the 

value, n. , which is increased by m. to n. +1 at 
l i i 

the end of theA x . increment. 


While x varies through the A)k increment, 
m. is added into the C register 128 N. times. 

l i 

The C register operates much like the A register, 
beginning and ending with 64 in it, but overflow- 
ing (N.)(m.) times. These overflows are scaled 
° l l 

down by -Yz and slimmed with the overflows from 
the D register to obtain the dZ output increments. 

While X varies through the AX. increment, 
the arying value in the B Register is added into 
the D register 128 Ik times. The value added from 

the B register into the D register corresponds to 
n i + m i ^ X-X i 1/128N. * from the interpolation 

formula. The output from the -Yz scaler added 
into the D register corresponds to -Yz ( nn ) d x 

from the interpolation formula. By taking the 
overflows for the sum of these terms from the 
7-bit D register, we obtain a 1/128 factor or 


dZ= 


its 


n. + m. 


'i m 


dX- ^ Yz 


m. 

l 


d x 


The DAFT Unit operates as a numerically 
drift-free interpolator which traces out a 
smooth curve through the Z^ points regardless of 

the number of cycles of the X input voltage. 

This curve will have a point by point uniformity 
for all cycles of X, 

When linear interpolation is desired, the 
overflow signals from the C register may be used 
as the d Z output and the A, B and D registers 
not be used. The curve traced in this case will 
be a chain of straight line segments joining the 
Z^ points. The values of n^ will be taken from 
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Z X 



Fig. 6 Interpolator Block Diagram 
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Fig. 7 Patchboard Storage Circuit 
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storage rather than nu . Only the first two 

restrictions on the X-Z points will apply in this 
case. 


Method of Operation 

The design of the system is arranged for 
ease of operation and testing. To set the 
function generator system into operation, the 
function patchboard is inserted into the DAFT 
Unit and the Control Unit Reset button pressed. 
Pressing this button does the following: 

1) Clears the X and Z counters in 
the converters to zero. 

2) Transfers the origin values of 
n and i from the patchboard in- 
to the B register and i counter 
of the interpolator. 

3) Sets 64 into the A, C and D 
registers and clears the Scalers 
of the interpolator. 

When the Control Unit is switched into 
computation, the function generator system pro- 
ceeds to generate the curve Z = f(X) from the 
origin to the current value of the X input 
voltage in 0.082 seconds or less. The function 
generator will then track on the variations in 
X until the Control Unit is switched out of 
computation. When the system is switched out of 
computation, the values of X and Z are locked on 
the last computed point of the curve. When 
computation is restarted, the generator will slew 
at the rate of 100,000 dX's per second to the 
current value of X. 

To perform a quick test for errors in the 
operation of the generator, the push button to 
set the X input to zero may be pressed. The X 
and Z indicators should both slew to a zero read- 
ing when this button is pressed and back to the 
current (X,Z) point when this button is released. 

To verify the operation of a new function 
patchboard, the manually controlled X input 
voltage may be used to obtain the (X., Z. ) read- 

l l 

ings on the indicator lamps that are prescribed 
for the function. When the readings are in- 
correct, the patchboard can be corrected and the 
revised data verified directly. 


Circuits 

All circuits are designed using germanium 
transistors and diodes. No tubes or mechanical 
choppers are used. All components are used at a 
conservative power and tolerance rating. The 
design goal is to achieve very reliable operation 
at low power consumption. 


The converter units use the circuits 
developed for the Packard Bell Computer Corpora- 
tion MULTIVERTER equipment. 

The circuits used for the DAFT interpolator 
are the circuits used in the Packard Bell 
Computer Corporation TRICE computer. These 
circuits consist of transistor flip flops, diode 
gates, and distributed delay line storage 
elements. 

Figure 7 shows the gating circuit designed 
for the Patchboard Storage Unit. This circuit 
has the moderate speed requirement of obtaining 
a new set of values on its 12 output lines with- 
in five microseconds after the i counter changes. 
One of 32 diode-and-gates transmits a DC signal 
depending on the state of the i counter. This 
signal is sent through an emitter follower at an 
8-volt level to a 12-contact common of the patch- 
board. The signal is connected or disconnected 
with patchboard jumper to twelve diode-Or-gates. 
The twelve Or-gate lines present the interpolator 
with the iik and n^ values in parallel. 

When the Reset Signal comes from the Control 
Unit, it is used to block the reading of m^ and 

n. values and introduce the value for n at the 

l 

origin on the Or-gate lines. This signal is also 
used to transfer the value for i at the origin to 
the i counter flip flops. 


X Frequency Response and Accuracy 

The DAFT Unit has a maximum operation rate 
of 100,000 dX increments per second. This rate 
is chosen to be compatible with the other units 
of the system. When the X scale is divided into 
16,384 dX increments, the value of X for a 
sinusoidal input of frequency, f, is given by 

X = (16,384/2) sin 2 it ft 

and the maximum rate of change of X is 

(dX/dtJ ax = i6,384tt f. 

Therefore, the system will track if the maximum 
full scale input frequency is 

f = 100, 000/16, 384TT = 1.95 cps. 

When the tracking frequency of 1.95 cps is 
exceeded by a signal of, say, 3 cps, the Analog- 
to-Digital Converter fails to follow closely the 
steep portions of the input curve. Figure 11 
indicates the discrepancy that will occur in this 
case. The result is approximately a triangular 
wave with 5% amplitude reduction, a maximum 
error of 21% of full scale and a phase shift of 
20°. The function generator continues to function 
properly but for the distorted input curve. 


The described system units can be used at 
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higher frequencies by reducing the number of dX's 
per full scale of X. For example, the X input 
resistor of the Analog-to-Digital Converter can 
be changed to give 8,192 dX's per full scale of 
X and the sum of the N. values reduces to 64. 

l 

The maximum full scale input frequency can then 
be 3.9 cps. The number of dX's for full scale 
can be further reduced to 4096 and all 32 N_^ 

values restricted to l's. This will accomodate 
a full scale input frequency of 7«8 cps. 

In general, the following relationship 
exists between frequency and the number of dX's: 

(number of dX's/full scale of X) = 100,000/Tff 

When the curve, Z = f(X), is characterized 
as a sine curve of C cycles, 

(number of dZ 's/full scale of Z) 

= (l/HC) (number of dX's/full scale of X) 

Therefore, 

(number of dZ' s/full scale of Z) 

= ioo,ooo/rr 2 cf = 10, 132/c f 

Where the output is expressed as a binary 
number with p binary stages used, the formula 
may be written as 

2 P = 10,132/Cf 

This last equation essentially defines the 
frequency-accuracy product of the DAFT function 
generator system. 

As an example, if the input frequency is 

3 cps and the Z = f(X) curve resembles 2 cycles 

of a sine curve, the generator can be set up to 

give an output curve that is smooth to one part 

. .10 
in 2 . 

~No. of dX's = 100,000/TT (3) ~ 10,496 = (82) (128) 

Tn. =82 

' £ — i 

No. of dZ's = 10,496 /tT (2) = 2 10 

for C = 2 cycles and f = 3 cps 

For another example, if the input frequency 
is 30 cps and the Z = f(X) curve resembles one 
cycle of a sine curve, the generator can be set 
im to give an output curve that is smooth to 

g 

one part in 2 . Only eight AX increments may be 
used for this input frequency. 

No. of dX's = 100,000/IT (30) ~ 1024 = 8(128) 

V N. = 8 
_ ^—1 


Q 

No. of dZ's = 1024/ TT (l) ~ 2 


| for C, = 1 cycle and f = 30 cps 

In order to obtain more AX. increments for 

l 

high frequency inputs, the DAFT Unit will have 
one other scaling arrangement: 

1) The 1/128 Scaler will be changed to a 
1/16 Scaler 

2) The B register will be shortened to 
4 bits plus sign. 

3) The number of dX's/ AX^ = (N^)(l6) 

4) n^ will be an integer from -16 to +15 

5) nu will be an integer from -16 to +15 

6) |n^ - /aiu |< 16 

The last example may then have all 3 2 X 

increments used 

No. of dX's = 100,000/fT(30) ~ 1024 = 64(l6) 

J N =64 

O 

No. of dZ’s = 1024/ Tf (1) ~ 2 

for C = 1 cycle and f = 30 cps 


Summary 

The set of units described provide a flex- 
ible, high performance system with which arbi- 
trary functions and mathematical functions can 
be obtained at high speed and high accuracy. The 
arbitrary functions are set up in numerical form 
on a patchboard and generated with first or 
second order interpolation. The mathematical 
functions are generated in their analytic form 
with interconnected Digital Differential Analyzer 
units. The performance of the system in 
generating both types of functions is approxi- 
mated by the formula 

2 P = 10,132/Cf 

where 

p = number of binary bits precision 
(including sign) in the output, 

C = The complexity of the function 
measured in terms of the number 
of sine curve cycles it contains, 

f = the full amplitude input frequency 
of the independent variable input. 

The independent variable may be followed 
to a precision of one part in 16,384 and the 
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output variable may be generated up to a preci- 
sion of one part in 10,000. The system operates 
at the rate of 100,000 cps so that in the worst 
case, the delay between a variation in the input 
and a corresponding variation in the output will 
be much less than 50 microseconds. 

Another type of function not treated 
explicitly in the body of this report is that of 
generating an analytic function of time such as 

Z = sin 2TT ft 

This function can be generated with a 
performance formula of 

2 P = 100,000/ f, 

where p is the number of bits precision (includ- 
ing sign) in the output. The value of p is 
limited to 14 or less by the output converter. 
For f of 64 cps the sine can be generated to 8 
bits plus sign and for f of 200 cps the sine 
can be generated to 6 bits plus sign. These 
curves can be generated for an unlimited number 
of cycles without amplitude drift. 


X 



Fig. 8 Slope - Limited Tracking Curve 
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MATHEMATICAL APPLICATIONS OF THE 
DYNAMIC STORAGE ANALOG COMPUTER 


Jack M. Andrews 
Computer Systems, Inc. 
6 1 1 Broadway 
New York 12, N. Y. 


Many recent developments in analog 
computers have substantially increased their 
versatility and speed. Probably the two 
most important of these innovations is the 
automatic repetitive operation at frequen- 
cies of 60 cps and the development of high 
speed memory with a tracking error of less 
than 10 microseconds. With repetitive op- 
eration and the simultaneous solution of 
large matrices of ordinary differential 
equations at speeds of 60 cps, the analog 
is much faster in operation than the digital. 
However, an inherent disadvantage is the 
size of computer required for a complex 
problem. In general, the more complex or 
larger a problem, the more conventional 
analog computer equipment that has been 
required for its solution. The digital com- 
puter, on the other hand, has, from its 
earliest inception, employed memory to allow 
time sharing of components. In this manner, 
a large complex problem could be solved by 
time sharing the digital computer equipment 
and using the same equipment repeatedly to 
solve the larger scale problems. Therefore, 
large problems required longer solution 
times rather than larger computers. The 
disadvantage of larger sizes in the analog 
computer has been overcome by the develop- 
ment of high speed memory which is employed 
in a manner similar to that utilized in the 
digital computer for time sharing to in- 
crease the time of solution rather than 
increasing the size of the computer for the 
more complex problems. However, the order 
of magnitude increase in solution time is 
much less severe with the analog computer 
employing memory than it is with the conven- 
tional digital computer, since the analog 
simultaneously solves systems of equations, 
while the digital computer must use reitera- 
tive procedures for such a system. 


SUMMARY 

The development of the high-speed 
analog memory has made possible sequential 
calculations through time sharing of the 


computer components. One of the most striking 
comparisons is found in the process industry, 
where distillation is a critical test of the 
computing capabilities of both the analog and 
digital machines. Prior to the development of 
high-speed repetitive operations and dynamic 
storage analog computers, a four component 
distillation problem with overhead, sidestream, 
and bottoms products would require in excess of 
1100 amplifiers for the simultaneous solution 
of a 30 plate column, or conversely, a solu- 
tion time of several hours. With the modern 
DYSTAC Analog Computer in repetitive opera- 
tion, such a solution can be obtained with 
only 43 amplifiers in approximately 1 minute 
of operating time. This is a very favorable 
comparison with the more expensive digital 
computers. 

Memory can also be employed through 
high speed sequential calculations to obtain 
definite integrals for automated, self-solu- 
tion of various problems. One of the most 
important of these in industry is the opti- 
mization of a definite integral with respect 
to one or more independent variables. Pro- 
bably the most common example is the reali- 
zation of an economic optimization for an 
operating process. However, other criteria 
can be employed for optimization such as the 
least mean squares fit of data in correlation 
schemes or the minimization of transients 
with respect to design parameters in an 
instrument control system. 

Furthermore, many partial s can be 
reduced to multiple definite integrations. 

Under these conditions a combination of 
repetitive operations and non-repet itive 
operations (real-time) is employed together 
with the continuous or point memories to 
provide a simple, accurate, and rapid solu- 
tion for multiple integration. 

The variables are not generally separ- 
ated in partial differential equations. 

Where they can be separated mathematically, 
a multiple integration technique, similar to 
that just discussed, is readily employed to 
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obtain a solution to the proboem. Where the 
partial s cannot be separated mathematically, 
the difficulty generally lies in a resolu- 
tion of the initial conditions or boundary 
conditions. In the latter circumstance, a 
forced balance solution which is readily 
incorporated into the analog computer may 
be of assistance in arriving at a solution 
of the problem. 

The development of memory for the 
analog computer has made it possible to 
incorporate many digital techniques into 
the current analog procedures. Continuous 
memory for the analog computer is an adapta- 
tion of digital techniques employing New- 
ton's Divided Difference Interpolation 
Formula. The employment of continuous 
memory immediately provides solutions for 
various types of transient operations and 
difference-differential equations problems. 
The ordinary solution time of even the most 
complex problems in this category is about 
20 seconds. 

Other applications which have stemmed 
directly from the repetitive operations and 
memory include an all electronic transport 
delay which is derived from standard com- 
puter components and differentiation with- 
out the noise that has heretofore been 
associated with such an operation on analog 
computers. 

DISCUSSION 


ard integrator in both non-repetitive oper- 
ation or real time will be discussed first. 
The circuitry of such an integrator is 
generally similar to that presented in 
Figure 1. 

In Figure 1, the high gain amplifier 
Is shown with the feedback capacitor (C) 
and input resistor (R|) which constitute 
a standard non-repetitive operating Integra 
tor. In order to reset the integrator to 
the applied initial condition voltage, 
both relays A and B are required. During 
the reset period of non-repetitive opera- 
tion, relay A Is open and relay B is closed 
During this period, the capacitor is 
biased to produce at the output of the 
Integrator a value of Eq, which is the 
negative value of the initial condition 
imposed on the initial condition resistor 
network. When the operate cycle is initi- 
ated, relay B is opened and relay A closes, 
so that the value of the voltage input E| 
can be integrated. The time- integrated 
value of E| from the appropriate initial 
condition is then available as the output 
voltage Eq. 

If the integrated value of Ej is a 
function of an independent variable, x, 
then Eq may be expressed as F(x). Figure 
2 shows the variation in the output volt- 
age Eq for a decaying function during 
the operate and reset cycles. 


DYNAMIC MEMORY 


In order to develop the theory of 
dynamic memory, the performance of a stand- 
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At the beginning of the operate cycle, 
the value of Eq is equal to the initial con- 
dition. During this subsequent integration, 
the value of Eq follows the time integral 
of x which corresponds to F(x). At the 
termination of the operate cycle, the 
computer is returned to the reset period 
where the value E Q changes to correspond 
to the initial conditions. 

If another integrator is in reset 
(relay B closed) and has no input other 
than F(x) through the initial condition 
channel, it will track F(x) and have for 
an output voltage the value -F(x) as shown 
in Figure 3. If the B relay is caused to 
open at any value (xj) of the variable x, 
then the output will subsequently remain 
constant at Fj(x) for all values of x 
greater than Xj. During the reset period, 
if x is less than xj , the memory will track 
the resetting of F(x). 

In this manner, by eliminating the 
normal input voltages and using only the 
initial condition input, memory may be 
incorporated into a standard integrator. 


While this procedure might be followed with 
any high-precision analog computer, the long 
RC time constant of the reset circuit and 
the slow operating speed of the B relay 
(one millisecond or greater) would limit 
its application to non-repet i tive solutions 
with long time constants. It will be 
shown that practical applications of this 
technique require repetitive operation at 
frequencies of from one to two hundred 
cycles per second, preferably 50 cycles 
per second or greater. Utilization of this 
procedure in repetitive operation is pos- 
sible only when the reset relay B is re- 
placed by an electronic equalizing gate 
or switch which operates at a speed of 
approximately 5 microseconds and resets 
the amplifier independent of the RC time 
constant of the initial condition circuit. 

Four types of memory mode are ordinari- 
ly employed in the circuits necessary for 
complex problem solutions. Shorthand 
symbols for these memory circuits are 
presented in Figure 4. 
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FIGURE 4 

DYSTAC MEMORY SYMBOLS 


The letter M Is utilized to indicate 
that each of the four amplifiers is employed 
in the memory application and is a reminder 
that the normal input channels to these amp- 
lifiers should not be used; the only inputs 
should be through the initial condition 
channel. The standard integrator, termed a 
forward memory and shown in Figure 4. (a), 
"tracks" the function present at the initial 
condition input during the computer reset 
period, "learns" it at the start of the 
operate period and "stores" or holds it 
during the operate cycle. 

In Figure 4. (b) a reverse memory is 
shown, in this application the polarity of 
the control signal is reversed so that the 
electronic reset switch operation is re- 
versed, and the output voltage tracks the 
i.c. voltage during the operate period. 

During the reset period, the output voltage 
is stored at the value last available through 
the initial condition input at the termina- 
tion of the operate cycle. Both of the 
memories in Figures 4. (a) and 4. (b) are 
controlled by the repetitive, automatic 
reset pulses generated by the computer. In 
the last two Figures, 4. (c) and 4. (d), 
the electronic reset switches or equalizer 
gates are actuated by external voltage 
comparators rather than by the repetitive, 
automatic reset pulses. Figure 4. (c) 


shows an amplifier whose electronic switch 
is normally closed until the external com- 
parator is actuated by an independent or 
dependent variable whose value exceeds that 
of an arbitrary voltage Xj. Figure 4. (d) 
shows an integrator memory whose reset 
electronic switch is normally open until 
the comparator is actuated. In the case 
of Figure 4. (c) the output voltage tracks 
the value available at the initial condi- 
tion input until the comparator is actu- 
ated, at which time the output voltage Is 
locked or stored at the instantaneous 
initial condition value. In Figure 4. (d) 
the output voltage is stored and constant 
until the comparator is actuated. At this 
time the output voltage tracks the initial 
condition input. 


SEQUENTIAL CALCULATIONS 

In order to perform high-speed sequen- 
tial operations which are automatically con- 
trolled by the computer, it is necessary to 
have an accounting circuit which records 
the number of steps that have been performed 
and whose output voltage can serve as an 
indication for relay amplifiers and compara- 
tive circuits through which the type and 
complexity of the sequential calculations 
can be regulated. An accounting circuit 
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which provides these features Is presented 
in Figure 5. This circuitry is composed of 
reverse and forward memory together with an 
inverter, summer, relay, and relay amplifier. 
If, for the purposes of discussion, it is 
assumed that M has a value of 30 and A 
has a value ofi , a constant value of one 
vol t wi 1 1 be added to P^j through a summer. 


learned and stored a value of zero which 
will appear as its output. During this 
cycle when P^ has a value of zero, the relay 
will return to its normally closed position 
and the value of P^ plus one volt will ap- 
pear at the initial condition input of the 
reverse memory. Therefore, during the next 
operate cycle the value P^ + ] appearing at 



The relay amplifier will compare +P^ 
with a value of 30 and operate the relay, 
driving it from the normally closed (x) to 
the normally open (0) position when P^ has 
attained an equivalent of 30 volts. Let us 
assume that P^ has reached a value of 30 
volts. This value will be maintained at 
the output of the forward memory during the 
remainder of the current operate cycle since 
the forward memory will be in "store". With 
the arm of the relay at the normally open 
position, a ground will be imposed through 
the initial condition channel of the reverse 
memory which will track this value and "learrl 1 
it at the end of the current cycle. During 
this reset period the reverse memory will 
store the output value of zero and the for- 
ward memory will track and learn it since 
its electronic equalizer gate is closed 
during the reset cycle. In the next opera- 
ting cycle the forward memory will have 


the output of the reverse memory will be 
one volt. This value is in turn stored 
at the output during the reset cycle, 
and the forward memory will learn a value 
of 1 volt which it will maintain during 
the following operate cycle. In this 
manner, P^ will have voltage values vary- 
ing in incremental steps of 1 volt from 
0 to the maximum value of M x . During the 
corresponding cycle the value appearing 
as Pfj + ] at the output of the reverse 
memory will always be 1 volt greater 
until Pm is equal to the maximum voltage 
M x at which time P^ + j will be zero. 

Either P^ or P N+ j may be used as control 
voltages to actuate either relay ampli- 
fiers or electronic switches to control 
the various sequential calculations which 
are incorporated in the cycle from 
PN * 0 to P N = M x . The cycle thus 
effected by this control will be 



124 

4.3 


reiterative and will cycle through the 
various sequential calculations until the 
repetition is halted by the operator. This 
feature provides a very efficient means 
for conducting sequential algebraic cal- 
culations such as are encountered in dis- 
tillation. difference differential equations, 
partial differential equations and in control 
problems where the control variables are 
cycled through a series of operating con- 
ditions or boundary conditions. 

The ratchet circuit shown in Figure 
6 is another example of a highly effective 
memory circuit which finds its most gen- 
eral application in the memorization of 
values from one cycle to be used in the 
succeeding cycle. 


However, as discussed in a technical data 
sheet by Computer Systems, Inc. entitled 
"Application of DYSTAC to a Multi-Compo- 
nent Distillation" this new analog develop- 
ment provides a rapid and economic solu- 
tion of mul ti -component distillation prob- 
lems by employing both the ratchet and 
accounting circuits for time sharing of 
computer components. The size of the 
computer is independent of the number of 
plates and is dependent only on the num- 
ber of components to be distilled and the 
number of product withdrawal points. Since 
the procedure is perfectly general, any 
desired specifications can be selected for 
a given case within the prescribed degrees 
of freedom. The solution is based on a 
trial and error procedure with a continuous 



X N+1 



RATCHET CIRCUIT 
FIGURE 6 


The forward memory will produce a 
value of xn which will be utilized through- 
out an entire cycle to calculate the value 
of x ^+1 that is necessary for the follow- 
ing cycle. This value, xn+ 1 » is derived 
from the associated problem circuitry, and 
will be memorized during the reset period 
to provide x N+ j for the calculation of 

x N+2, etc. 

Distillation is one of the most 
severe tests of the capabilities of either 
a digital or analog computer to solve 
sequential calculations. Before the 
development of DYSTAC, the typical distil- 
lation problem was either too expensive or 
too tedious to solve on the analog com- 
puter, and the digital computer was em- 
ployed almost entirely in the distillation 
calculations of the process industries. 


solution frequency of 60 plates per second. 
A solution with an accuracy of 0.1% can be 
obtained for a given case in about 1 minute 
after the basic data has been set in the 
computer. 

Only 43 operational amplifiers are 
required for a four component distillation 
with overhead, sidestream, and bottoms 
withdrawal products. This contrasts with 
a requirement In excess of 1100 amplifiers 
for conventional analog computer equipment 
wherein the entire column Is solved simul- 
taneously. If more than 30 plates are 
required, the amplifier requirement is 
proportionately increased. In the current 
application, the number of amplifiers is 
independent of the number of plates be- 
cause the various electronic components 
are time hhared so that from cycle to cycle 
each component is used repeatedly in suc- 
cessive solutions for the various plates. 
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THE DEFINITE INTEGRAL 

In many problems It Is desirable to 
evaluate In repetitive operations at defi- 
nite and sometimes changing limits the 
Integral of various quantities. Typical 
examples include economic optimization, 
the minimization of transients through 
design parameters in a control system, and 
the evaluation on a ‘least mean squares 
basis of the proper correlation coeffici- 
ents. 

The evaluation of a definite integral 
by the employment of X and 0 memories in 
repetitive operation is illustrated in 
Figure 7» 


indefinite Integral as it is produced in 
repetitive operation mode until x is equal 
to a. At this point, the X memory locks 
and the 0 memory begins to track with the 
result that the 0 memory has as its output 
the previous evaluation of the definite 
integral from zero to a from each preceding 
cycle. It is quite feasible to vary the 
limit of integration a from one cycle to 
another. In this case it would be provided 
by some other portion of the circuitry in 
accordance with the desired problem solution. 

When it is desired to obtain the eco- 
nomic optimum operating conditions for a 
process it is generally necessary to evalu- 
ate dffinite integrations. A general out- 



FIGURE 7 


in this figure, it is desired to 
determine the definite integral of f(x,y) 
with respect to the variable x. The 
desired function is the drive for inte- 
grator 1, wherein the proper initial 
condi t ion, y<?,is also available, and the 
output of integrator 1 is the indefinite 
integral with respect to x. In this ap- 
plication, integrator 1 will be operated 
in the repetitive mode. Integrator 2 will 
also be operated in repetitive mode, and 
its purpose is to generate a ramp function 
for the evaluation of the independent 
variable, x. x is compared with a value 
a from reference voltage to establish the 
upper limit of the desired definite in- 
tegral. The X memory will track the 


line of the procedure for optimization 
fol lows. 

First, the equation for economic 
evaluation of the products and cost of 
operating variables under investigation 
is derived. This will probably contain 
definite integrals as In the case of the 
products from a fixed bed catalytic 
reactor, wherein the amount of catalyst 
is fixed, although the operating condi- 
tions such as temperature, pressure, 
and feed may be varied for optimization 
purposes. Second, the economic equation 
is differentiated with respect to the 
variables under investigation. Each 
derivation will provide a new and 
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Independent optimization equation which will 
contain derivatives of the original definite 
integrals. The desired optimum is realized 
when the value of each derived equation in 
the matrix has a value of 0. The deriva- 
tives containing definite integrals from 
the second step are to be evaluated in 
repetitive operation by the use of X and 
0 memories as indicated in the analysis of 
Figure 7 * Once the successive derivatives 
have been evaluated by repetitive operation, 
they are employed with the necessary alge- 
braic signs derived from the maximum or 
minimum nature of the calculations to drive 
NRO integrators whose outputs are arbi- 
trarily equated to the respective operating 
conditions. The operating variables as 
produced from the NRO integrators are em- 
ployed in feed-back loops for both the 
REPOP calculation and the simultaneous 
evaluation of the optimization derivatives. 
When the desired optima are reached, the 
NRO integrator drives will be zero, and 
the operating conditions will remain con- 
stant at their respective values. It is 
also quite feasible to place restrictions 
on the operating conditions by the employ- 
ment of X and 0 memories. If any of the 
variables become restricted before their 
respective optima are attained, the remain- 
ing variables will still be optimized em- 
ploying the restricted values for the 
pertinent variables. 


A similar technique can be employed with 
repetitive operation evaluation of definite 
integrals and feed-back loops from non- 
repet it ive operation to evaluate the design 
variables which minimize the transients in 
an instrumentation problem. Another example 
is the evaluation of the correlation coef- 
ficients which provide the best least mean 
squares fit of observed data. 


MULTIPLE INTEGRATION 

There are many instances where 
multiple integration requiring successive 
evaluations of a definite integral is a 
very complex procedure that necessitates 
analog computer techniques more advanced 
than those heretofore available. The fixed 
bed converter with axial and radial tem- 
perature gradients, as proposed by Wilhelm, 
is a classic example of a problem which 
can be solved by the multiple integration 
technique. Although this problem is too 
complex to detail in the current presen- 
tation, the philosophy involved in such 
a calculation can be summarized with a 
trivial case. The multiple integration 
to be described is presented in Figure 8, 
where it is desired to determine the volume 
of a cylinder with varying radius. This 
volume is the double integration of the 
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radius and the length. The radius at the 
wall for any particular point along the 
axis is obtained from a diode function 
generator. The radial integrators operate 
in repetitive mode while the axial integra- 
tors operate in non- repet i tive mode. The 
value of x is obtained from integrator 1 
(non-repet i tive) and fed to the DFG from 
which the instantaneous value of the radius 
at the wall, r wj is obtained, r is obtained 
from the repetitive operating integrator 2 
and is fed through a pot set of ZTT^Jrto 
repetitive integrator 3 which gives the 
cross-sectional area at any point in a 
cycle. The upper limit of this integra- 
tion is fixed by X & 0 memories. The X 
memory tracks until r is equal to the 
radius of the wall, at which point it locks. 
The 0 memory starts tracking when the com- 
parator shows that the wall has been 
reached, and its output is always the pre- 
vious value of the cross sectional area at 
the wall. This, in turn, is directed to 
integrator 4 operating non-repet i tively. 

The output of this last integrator is the 
volume of the cylinder at any axial loca- 
tion, x. 

While the previously described 
multiple integration is trivial, the same 
sequence is followed in developing the 
temperature gradients in a fixed bed 
chemical reactor operating under steady 
state conditions. Vaporized reactants 
are introduced at the inlet to the reactor, 
and the total reaction in the first incre- 
ment of the reactor is integrated radially. 
The total heat released in the first incre- 
ment is also calculated so that both the 
concentration of the reactants and their 
temperature can be calculated for the 
input to the second increment. As in the 
previous illustration, the radial incre- 
ments of the fixed bed reactor are calcu- 
lated in repetitive mode while the axial 
variation is again summed with non- 
repetitive integrators. The radial 
variation of both temperature and concen- 
tration of the individual reactants and 
products is presented continuously in 
the oscilloscope and can be plotted for 
any desired radial point as a function of 
x, or axial length, on an X-Y plotter. 

The increment in reactor length can be 
reduced for each cycle until further 
reductions, do not change the problem 
solution. Under these conditions the 


maximum computer accuracy has been achieved 
and the minimum solution time can be deter- 
mined. in this case, a reactor solution 
time of 20 seconds with a frequency of 
60 cps will provide 1200 increments, or 
slices, through the reactor bed and should 
be sufficient. 

The solution procedure outlined can- 
not be obtained from conventional analog 
computers because the known radial bound- 
ary conditions are split and the radial 
function R must be solved automatically 
by the computer in repetitive operations. 

In this particular problem, the value of 
r (a function of radius, r) is known at 
the wall while ft* is known at the center 
of the tube where the radius, r, is zero. 
This solution is readily accomplished in 
repetitive operations with the DYSTAC 
memories as shown in Figure 9. The 
solution technique reduces the split 
boundary value problem in R to an initial 
condition problem. 
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In Figure 9* Integrators I, 2 and 3 are all 
in the repetitive operations mode. Integra- 
tor 3 produces r, the independent variable 
which is divided into R 1 to furnish one of 
the feedback loops. The other feedback 
loop is provided by -R from summer 1. 

If Rq were known the solution In the 
repetitive operating mode would be com- 
plete. An automated solution is obtained 
by observing that the value for R^ at the 
wall is equal to the initial condition 
value plus the integrated change in R at 
the wall, AR W . In order to determine Rq, 

A R is continuously added to the appro- 
priate value of R w in summer 2. The out- 
put of this summer is not the required 
value of Rq, because there is no subscript 
w on the AR increment. This subscript 
is realized by the employment of X and 0 
memory. The X memory tracks until the 
wall of the cylinder is reached, as deter- 
mined by a comparison of on a pot with 
the independent variable r. When this 
comparison is realized the X memory stops 
tracking, and is locked on the value which 
is then a trial value for Rq The 0 mem- 
ory is locked until a comparison is 
realized, at which time it tracks or 
learns the value in the X memory. In this 
manner, the previously determined trial 


value for Rq Is always available as the 
output of the 0 memory. In the trial 
and error solution sequence, some value 
for Rq will be maintained as the initial 
condition throughout a cycle. From this 
value AR„ can be calculated at the wall, 
and a new value for Rq will be assumed. 

The sequential assumptions will be fol- 
lowed until there is no further change 
with the value for R^, being precisely 
equal to the required value. The solu- 
tion sequence is very rapid. 

In the previously discussed appli- 
cation the independent variables in the 
problem have been separated by the as- 
sumptions on which the problem was 
hypothesized. In the more general case, 
the variables must be separated mathema- 
tically and frequently they cannot be 
separated at all. However, there are 
procedures where both of the latter cases 
can be generally handled satisfactorily 
with the DYSTAC Computer. In the first 
case, the variables are separated mathema- 
tically, and the multiple Integration 
technique can be employed. In the other 
case a form of forced balance can be 
used for continuous self-solution of 
the problem's boundary conditions. 





Variables can frequently be separated 
by a conventional substitution for the 
dependent variable of a product of two 
new dependent variables which are res- 
pectively functions only of the individ- 
ual independent variables. Another means 
of seperation is the employment of 
Laplace transformations when one of the 
variables has been transformed, an ordi- 
nary differential in terms of the trans- 
formed variable is obtained, A solution 
of this ordinary equation will in turn 
yield an explicit evaluation of the 
transformed variable in terms of the 
transform s and the remaining independent 
variable. The final explicit equation 
can be solved using standard instrumen- 
tation techniques to evaluate the trans- 
formation operator in repetitive opera- 
tion while employing values of the 
remaining independent variable generated 
from non-repetit?ve components. 


CONTINUOUS MEMORY 

Point memory and the ratchet circuits 
will solve many problems involving ordi- 
nary and partial differential equations; 
however, there is frequently a require- 
ment for continuous memory of a varying 
dependent or independent function. Al- 
though there are several variations of 
the proposed continuous memory circuit, 
a simple illustration will serve to 
demonstrate its potential applications. 
The memory circuitry to be Investigated 
is shown in Figure 10, 

The most common requirement of con- 
tinuous memory is illustrated by the 
variation of f(x) with the solution 
curves 1, 2, and 3. In this case the 
function FI for f(x) is required as 
input to the computer circuit to predict 
F2, Solution FI will be required for 
one full cycle of the repetitive opera- 
tion during which the values of F2 for 
the subsequent operating cycle will be 
produced. In the subsequent operating 
cycle the values from F2 will be required 
to calculate another curve F3. The 
memory circuit is based upon Newton's 


Forward Interpolation Formula, This 
relationship states that if for equal 
increments of the dependent or indepen- 
dent variable x the values of f(x) are 
known, these values may be used to 
express f(x) as a function of x as 
shown in the polynomial equation. The 
values of the coefficients. A, B, C 
and D, for this equation are derived 
from the differences as shown in the 
example of a difference tabulation. 

Thus, If the values of f(x a ), f(x|j), 
and f(x c ), and f(x d ) are known, they 
may be directed to a summing network 
from which the coefficients. A, B, C 
and D, can be derived. Simultaneously, 
the value of x may be sent to a function 
network which produces the required 
products of x, x-1 , and x-2, that are 
necessary for Newton's Formula. 

One simple scheme whereby the re- 
quired values of f(x) for curve 1 may 
be retained in memory while new values 
for curve 2 are learned for the subse- 
quent cycle is shown in the four pairs of 
X and 0 memories. During the first cycle 
the values of f(x a ), f(x{j), f(x c ) and 
f(xd) for curve F0 are present at the 
outputs of the respective 0 memories; 
these values are then fed to the required 
summing network that produces the coeffi- 
cients, A, B, C and D, which are used in 
the development of the curve FI as x 
varies. At the same time the respective 
X memories have relay comparators which 
open the first memory when x is equal to 

a, the second memory when x is equal to 

b, the third memory when x is equal to 

c, and the fourth memory when x is equal 
to d. Since the X memories are fed by 
the computer with the newly calculated 
durve FI, at the end of the cycle the 
values of f(x a ), f(xfc>), f(x c ), and f(x d ) 
for curve FI are locked in the respec- 
tive X memories. It is necessary to 
translate these values into the 0 mem- 
ories for the subsequent cycle, and this 
is done at the end of the current cycle. 
This is accomplished by one common 
comparator circuit that switches all 
four 0 memories when the value of x is 
equal to » which is a value greater than 
x d . This comparator operates the 
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electronic reset switches and allows each 
one of the 0 memories to learn the new 
value of the corresponding X memory. Then 
the values of the desired coefficients for 
the FI curve are available to produce 
curve FI through Newton’s Formula during 
the subsequent cycle from which the solu- 
tion F2 can be obtained. In a similar 
manner, solution F3 is obtained from F2, 
etc. 


In general, where three increments 
or four points are utilized in the Dif- 
ference Formula, a third degree approxi- 
mation of the desired function can be 
derived. if more accurate approximations 
are required, fourth, fifth, sixth, and 
higher degree polynomials can be evalu- 
ated by the inclusion of more memories 
and higher order differences. 

When continuous memory as previously 
discussed is incorporated in a calcula- 
tion, the parameter for the curves Fg, Fj, 
F 2, F^ and F^ may be a time increment 

which leads directly to transient problem 
solution. The increment might also be a 
theoretical stage in a stagewise operation 
and in this manner difference-differential 
equations can be solved. Many other appli- 
cations can be visualized for continuous 
memory. 


these differentiations are relatively 
noise free. In this manner, many 
problems which have not been practical 
heretofore because they required dif- 
ferentiation can now be successfully 
analyzed with an analog computer. 


It may be concluded that the DYSTAC 
Computer has provided rapid economic 
solutions to problems which have hereto- 
fore been so tedious or expensive to 
resolve that either gross approximations 
have been incorporated into the solution 
or else the problem has been neglected. 


OTHER APPLICATIONS 

Although the details will not be 
discussed at the present time, an all 
electronic transport delay can be de- 
veloped from the DYSTAC memories. This 
delay uses memories in a bucket brigade 
circuit with any desired accuracy of 
interpolation based on Newton's Divided 
Difference Formula or any other inter- 
polation technique that is desired. Any 
degree of accuracy may be obtained with 
this transport delay by balancing the 
number of sample points and the inter- 
polation. Its accuracy is far in excess 
of that available from the Fade 
approximat ion. 


The infinitesimal delays which 
can be incorporated into X and 0 memories 
can also be employed to differentiate and 


(022460 s) 
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RECOGNITION OF SLOPPY, HAND-PRINTED CHARACTERS* 

Worthie Doyle 

Staff Member 
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Lexington, Massachusetts 


Summary 


This report describes a pattern recognition 
scheme particularly intended to handle noisy and 
highly distorted data. The sample is subjected to a 
set of tests at the conclusion of which a single deci- 
sion is made. The decision is made on the basis of 
experience obtained from prior processing of label- 
led samples. The method has been applied to hand- 
printed English capitals but is evidently general. 
Results are given for some trials made on the IBM 
709. 

1. Introduction 


In any recognition scheme the raw data are 
transformed or tested and from the results decisions 
are made. Usually^' ^ the process consists of a 
sequence of tests together with rules for branching 
after each test, the final branch furnishing the de- 
cision. Any recognition method employing such a 
sequence of decisions to reach an ultimate verdict 
is evidently limited by the weakness of its worst 
tests. Tests which one might off hand think infal - 
lible are easily beaten by malevolent nature. For 
example, in the course of devising tests for charac- 
ter recognition a good datum to consider seemed to 
be the maximum number of intersections of the char- 
acter by a horizontal straight line. Most M' s were 
expected to have four; of the 46 samples at hand 11 
did. Our noisy data also included two each of R and 
T and one each of A and N having four. Unless the 
tests are good indeed or the samples to be recogniz- 
ed quite free of noise and distortion a parallel pro- 
cessing technique seems preferable. 

With parallel processing all tests are made 
before any decision is undertaken. -Though the in- 
dividual tests may be poor a reliable decision can 
be available provided there are enough tests, each 
contributing some different fractional bit of informa- 
tion. Such a set of tests may be considered a code 


The work reported in this paper was performed by 
Lincoln Laboratory, a center for research operated 
by Massachusetts institute of Technology with the 
joint support of the U.S. Army, Navy and Air Force. 


with redundancy, an appropriate counter to noise 
and distortion. The method here used to combine 
the test results resembles correlation detection 
of electrical signals and is an elementary realiza- 
tion of Pandemonium^), 

Another point sometimes overlooked in ap- 
plying tests to data from nature, as opposed to 
ideal representatives, is that the results, while 
still significant, are not what consideration of the 
ideal representatives might have led one to expect. 
Thus the action to be taken following a particular 
test should be determined by observing the results 
of testing real known samples from the population 
of characters one eventually hopes to be able to 
recognize rather than by applying preconceived 
rules. Such a " learning" process seems to be 
present in only two schemes so far reported(4> 5) 
Some preconception is unavoidable but is here 
present only in the choice of the data reduction 
mechanism and not in the decision making process 
itself. 

The scheme to be described incorporates 
both these notions -- decision reserved until all 
test results are in, and discrimination based on 
" learning " from real data. 

2. General Description of Character 
Recognition Scheme 


This section describes what has been pro- 
grammed and tried. Some reasons for these choices 
and against certain others are given in section 5, 
where some planned improvements are also noted. 

The present organization is depicted in fig- 
ure 1. The box " Process " first does .some minor 
filling and deleting on the original pattern (details 
in Appendix A), hopefully a noise -reduction me as - 
ure , and then produces a sequence of symbols 
representing the outcomes of various tests* applied 


The tests used for trials on hand-printed English 
are listed in Appendix B. 
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to each sample. For example, one already mention- 
ed test on characters produces as outcome the max- 
imum number of intersections of the character by a 
horizontal straight line. Thus for most O' s this test 
produces the symbol 2. The results of this data re- 
duction are collected in a table, the first census, 
which contains a tally of the number of occurrences 
of each test outcome for each pattern. A section of 
this census corresponding to application of the test 
just mentioned to ten characters looks like figure 2. 
(This census is for about half the population). In the 
learning phase such data are collected for each test 
and are organized in the hierarchy - Test, Charac- 
ter, Outcome, Number. After all the labelled sam- 
ples have been processed the census is re-organiz- 
ed in the order - Test, Outcome, Character, Num- 
ber; finally the observed incidences of outcomes are 
combined with assumed a priori probabilities of oc- 
currence for each character to calculate the inverse 
probabilities for each character given test and out- 
come. Effectively these probabilities form the block 
labelled " Census " . 

In the working phase the block " Process " is 
identical to the corresponding one in the learning 
phase and does exactly the same data reduction job. 

" Compare with experience " amounts to using the 
observed outcomes of the tests on unknowns to look 
up and extract pertinent portions of the second cen- 
sus for use by the next block. 

The block " Decide " forms weighted aver- 
ages of the inverse probabilities corresponding to 
the observed outcomes from the separate tests and 
indicates which character corresponds to the highest 
of these averages of a posteriori probabilities. This 
is our guess . 

3. Hand-Printed English (Problem) 

The particular restrictions which distinguish 
the example tried were chiefly determined by cost 
and convenience. Herb Sherman, for work reported 
in (1), had already collected several hundred sam- 
ples by having visitors to Lincoln Lab print their 
names and addresses in a series of quarter inch 
squares. Some of these samples had been quantized 
in a 32 x 32 array of bits and these characters, to- 
talling 329 samples, were kindly supplied on IBM 
cards by Mrs. Betty Howell. Later a further 600 
were made available on flexowriter tape prepared on 
TX-2 by Phil Peterson. * Since the trials were to be 
made on the IBM 709 this 32 x 32 array was augment- 
ed by a double row of zeros all around, it being con- 
venient to store the raw bit patterns as 36 consecu- 
tive machine words. This choice of format means 
that fine detail will be lost, which in turn imposes a 
restriction on the size of characters to be processed 
-- they should not be so small that quantizing into a 

The messy job of providing these last with identi- 
fying labels was cheerfully done by Martha Evens 
and Connie McElwain, while Barbara Lucy progrm- 
med the transformation from tape format to 36 word 
bit-matrices . 


32 x 32 array destroys essential information. 

Aside from this, distortion and some tilt are allow- 
ed as well as noise in the form of stray bits and 
holes. The characters may be anywhere in the 
field. To appreciate the nature of the data used 
consult figures 3 and 4 which reproduce most of 
the A' s and R' s involved in our trials. 

Finally because of the general paucity of 
data only the letters AEILMNORS T were 
used for the trials, those letters accounting for 
about two thirds of the total samples. This cir- 
cumstance makes recognition easier, of course, 
so that fewer and cruder tests could be used in 
the trials. Very little thought was given to the 
exact nature of the tests except that they were re- 
quired to be easily programmed and fast running, 
a supposed advantage of Pandemonium being its 
ability to get along with relatively poor tests. 

4. Hand-Printed English (Results-) 

For a trial of the scheme on the 10 char- 
acters the stock was shuffled and split into two 
parts, each of which contained about half of the 
samples for each character. One half of the sam- 
ples were fed to tne learning phase and the other 
half used as unknowns. The roles of the two halves 
were then interchanged for a second trial, so that 
each half was recognized using the experience de- 
rived from testing the other half. The results of 
such a pair of trials appear in figure 5. The diag- 
onal terms are observed frequencies of correct 
recognition, while an off-diagonal term is the fre- 
quency with which a character labelled as at left 
was incorrectly guessed to be the character at 
top. The "fraction correct" is the average of the 
diagonals . 

In addition, several sets of weights were 
tried for the tests used, including omission of sets 
of similar tests. With some omissions the error 
rate increased as much as 5 % , though generally 
the process seems insensitive to the adjustment 
of test weights, indicating a fair degree of inform- 
ation overlap. 

5. Discussion 

Most of the visible defects of the process 
just described are due to the manner of laying up 
the experience derived during the learning phase 
and of referring to it when identifying an unknown. 
Ideally, when testing a sample the results of all 
the separate tests should be retained in combina- 
tion and the experience stored as if there were a 
single grand test whose outcome is a vector com- 
posed of the outcomes of the several elementary 
tests. However, such a procedure would require 
a considerably greater amount of storage space for 
experience if any reasonably large sample of a 


Unpublished note of Les Wilson. 
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population were presented for learning. This con- 
sideration was responsible for the original decision 
to throw away the information provided by the joint 
occurrence of test results. If it were possible to 
impose a measure of sameness on the space of test 
results then the storage capacity required for ex- 
perience would be vastly reduced. Without some 
means of deciding degree of likeness between liter- 
ally different test outcomes it is necessary to store 
a complete test outcome vector for each different 
combination of outcomes that occurs. The proce- 
dure described in section 2 stores the separate out- 
comes in isolation, so that storage requirements 
grow proportionately to the number of outcomes for 
each of t tests rather than as the t-th power of the 
number of outcomes. 

Some intelligent compromises should per- 
mit retaining most of the information inherent in 
the joint outcomes without undue increase in storage 
requirements and two such compromises are going 
to be tested. 

First of all, it is possible simply to combine 
a small number of the original tests into a single 
larger test whose outcome is the sequence of sym- 
bols resulting from mere stringing together, inpre- 
assigned positions, of the outcome symbols from 
the separate tests. If the groups thus combined con- 
sist of blocks of tests which, as blocks, are roughly 
independent of each other then most of the informa- 
tion of a single grand test will still be retained. This 
approach is almost ready for trial with one reason- 
able grouping of our present 28 tests into 10 com - 
binations and will be reported on later. 

Second, certain of the tests, for example 
those for cavities, may be so designed as to have a 
natural measure of distance between results. For 
these the experience may be stored in a more com- 
pact form and a mixed look-up procedure used in 
the block " compare with experience " . This would 
produce not only savings in data storage, but also 
an ability to make decisions even when test results 
on unknowns differ from all the results obtained on 
the labelled samples during the learning phase. 

Appendix A 

The Filtered Image in Storage 

The bit matrix originally presented is slight- 
ly modified to fill small holes and eliminate isolated 
bits and pieces. Both the filling and elimination rules 
used are rudimentary and depend upon examination 
of the 3x3 array whose center bit is under consid- 
eration. Rules are given in figure 6, the deletion 
rule being strictly from Unger(^). The fill rule has 
been applied first since our data appeared to suffer 
more from gaps than extra spots. 

The modified image is then supplemented by 
three rotated replicas, each of 36 machine words, 
which represent views from the other three sides. 
Thus most of the basic tests yield four separate pieces 
of information (sometimes two). It is these modified 
images which are actually tested. They are referred 
to as the North, West, South, and East views in ap- 
pendix B, where the tests used are listed. 


Appendix B 


Tests Used in Trial 

Each test was given a six character name 
and the tests are here listed under these names. 

The outcome of each test is a string of at most 
12 digits from the range 0 to 7 inclusive. There 
is no necessity for these choices; they happen to 
be convenient when trials are to be made on a 
general purpose binary calculating machine like 
the IBM 709. The description of each test states 
the rule for determining this sequence of symbols, 
starting from the filled and deleted bit pattern. To 
help this description a typical character is repro- 
duced in figure 7. The result given under each 
test is for this A. 

RHCSEQ 1212 

The sequence composed of the numbers of 
intersections of the character by a horizontal line, 
scanning from top to bottom, after adjacent rep- 
etitions have been eliminated. 

RYCSEQ 1212321 

As above, but going cross -wise. 

VERUNS 1212 

The sequence composed of the relative 
lengths of the runs of each section of RHCSEQ. 

Let H, W be height and width of the character and 
[n] denote greatest integer not exceeding n. Then 
each digit is given by [ 4 x RUN/H] + 1. 

HORUNS 1111122 

As above, but for the runs leading toRVCSEQ. 
HOMXSC 2 

The biggest digit in RHCSEQ. 

VEMXSC 3 

The biggest digit in RVCSEQ. 

VERSTR 2 HORSTR 2 

Take the number of blocks of bits in each 
horizontal (vertical) slice of the array. Add these 
numbers and normalize by height (width). 

The next block of tests contains four of each 
type, corresponding to views from north, east, 
south, and west. The results from the sample are 
given in that order, following the test’ s rootname. 

BOTSG 12 2 1 

That number of intersections which first 
occurs for a significant (now l/5 H or W) number 
of slices as one moves in from the side inquestion. 

BEN 11 10 11 21 

Work in from a side until you encounter a 
slice with more than one block of ones. Starting 
with this slice, OR together with it each succeed- 
ing slice. After each new slice is OR-ed in count 
the number of blocks of ones. When the count de- 
creases stop. Output of the test is a pair of digits 
A, B. The first. A, is the last count of blocks. 

The second, B, is 4 times the number of steps be- 
fore stopping divided by the height or width, which- 
ever is appropriate. 

EDGE 1313 

Moving in from an edge, the first three non- 
empty slices are OR-ed together and the number 
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of bits in the rightmost block of ones of this word 
is compared with the height or width. Output is 
[ 4 x BITS/H (or W) + . 5] . 

BOT 121 220 220 121 

Work in from a side and at each slice 1) 
count the number of blocks of ones in this slice, 

2) OR this slice into a word consisting of the union 
of all previously processed slices and 3) keep count 
of the number of slices processed. When the re- 
sult of 1) exceeds 1 (or you run out of slices) stop. 
Output of this test is three digits A, B, C: 

A. count of the number of blocks of ones 

in the union of slices 

B. number of blocks of ones in the slice 

where you stopped 

C. the count in 3) normalized to height or 

width: 

[ 4 x COUNT/H (or W ) + . 5 ] . 


CAY 112 1 

Work in from a side and form union of first 
non-empty slice and its successor. Then alternate- 
ly adjoin to this union successive slices and count 
the blocks of ones in the union. Stop when this block 
count decreases or you are halfway through the char- 
acter. Output is the penultimate block count. 


The following two tests form useful combin- 
ations of theEDGE test results, recovering some 
previously discarded information. 

NO/SOU 3 

Form the ratio of the outputs of NEDGE and 
SEDGE. Consult a list of thresholds to derive a 
digit from 1 to 5 as test output. 5 means " verybig", 
3 " about the same " and so on. 

EA/WES 3 

Same as above, but for EEDGE and WEDGE. 


(4) Grimsdale et al. , " A System for the Auto- 

matic Recognition of Patterns " , Proc. 
I.E.E., paper No. 2792M, Dec. 1958, 

p. 210. 

(5) W.W. Bledsoe and I. Browning, "Pattern 

Recognition and Reading by Machine ", 
Eastern Joint Computer Conference, 
Dec. 1959. 
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Figure 1 

ORGANIZATION OF PATTERN RECOGNITION SCHEME 
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Figure 2 

PORTION OF FIRST CENSUS 
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CONFUSION MATRICES (ABOUT ".875 RIGHT" AND ABOUT ".872 RIGHT") 



a b c 


d e f 


g h i 


Fill: e’ = e + a* i + b. h + c • g + d* f 

Delete: e' =e • (b + c + f)* (d + g + h) + e • (d + a + b)- (f + i + h) 


Figure 6 

ROLES USED FOR IMAGE FILTERING 
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SAMPLE CHARACTER WITH TEST OUTCOMES 
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EMPIRICAL EXPLORATIONS OF THE GEOMETRY THEOREM MACHINE * 

H. Gelernter, J. R. Hansen, and D. W. Loveland 
IBM Research Center 
Yorktown Heights, New York 


Introduction 

In early spring, 1959, an IBM 704 com- 
puter, with the assistance of a program com- 
prising some 20 000 individual instructions, 
proved its first theorem in elementary Eucli- 
dean plane geometry 1 . Since that time, the 
geometry theorem-proving machine (a partic- 
ular state configuration of the IBM 704 spec- 
ified by the afore -mentioned machine code) 
has found solutions to a large number of prob- 
lems^ taken from high school textbooks and 
final examinations in plane geometry. Some of 
these problems would be considered quite diffi- 
cult by the average high school student. In 
fact, it is doubtful whether any but the brightest 
students could have produced a solution for any 
of the latter group when granted the same 
amount of prior "training" afforded the geom- 
etry machine (i.e. , the same vocabulary of 
geometric concepts and the same stock of pre- 
viously proved theorems). 

The research project which had as its 
consequence the geometry theorem -proving 
machine was motivated by the desire to learn 
ways to use modern high-speed digital compu- 
ters for the solution of a new and difficult class 
of problems; a class heretofore considered to 
be beyond the capabilities of a finite state auto- 
maton. In particular, we wished to make our 
computer perform tasks which are generally 
considered to require the intervention of human 
intelligence and ingenuity for their successful 
completion". The reasons behind our choice of 
theorem-proving in geometry as a representa- 
tive task are set forth in detail in an earlier 
paper^. We only remark here that problem- 
solving in geometry satisfies our definition of 
an intellectual activity, while being at the same 
time especially well suited to the approach we 
wished to explore. The fact that geometry is 
decideable is irrelevant for the purpose of our 
investigation. The methods employed by the 
machine are suitable as well for the proof of 
theorems in systems for which no decision al- 
gorithm can exist. 


#This report is a summary of the full paper to 
be submitted for publication in an appropriate 
journal. 


We shall not labor the question as to 
whether our machine is indeed behaving intelli- 
gently in performing a task for which humans 
are credited with intelligence. The psycholo- 
gists offer us neither aid nor comfort here; they 
have yet to satisfactorily characterize such be- 
havior in humans, and have rarely considered 
the abstract concept of intelligence independent 
of its agent. In the final analysis, people are 
occasionally observed to do things that may best 
be described as intelligent, however vague the 
connotations of the word. These are, in general, 
tasks involving highly complex decision proces- 
ses in a potentially infinite and uncontrollable 
environment. We should be most happy to have 
our machine duplicate this kind of behavior, 
whatever label is affixed to it. 

Heuristic Programming and the Geometry Machine 

The geometry machine is able to discover 
proofs for a significant number of interesting 
theorems within the domain of its ad hoc formal 
system (comprising theorems on parallel lines, 
congruence, and equality and inequality of seg- 
ments and angles) without resorting to a decision 
algorithm or exhaustive enumeration of possible 
proof sequences. Instead, the theorem-proving 
program relies upon heuristic methods to re- 
strain it from generating proof sequences that 
do not have a high a priori probability of leading 
to a proof for the theorem in question. 

The general problem of heuristic pro- 
gramming has been discussed by Minsky^ and 
Newell, Shaw, and Simon^ . The particular ap- 
proach pursued by the authors has been des- 
cribed at length in the papers to which we have 
already referred ^ . We shall therefore defer 
to the presentation of the machine's detailed re- 
sults in the full paper summarized here for a 
description of how these results were achieved. 

It should be recorded here, however, that the 
geometry machine operates principally in the 
analytic mode (reasoning backwards ). At each 
stage of the search for a proof, a goal exists 
which must be "connected" with the premises for 
the problem by a bridge of axioms and previously 
established theorems or lemmas. If the connec- 
tion cannot be made directly, then a set of "sub- 
goals" is generated and the process is repeated 
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for one of the subgoals. Heuristic rules are 
used to reject subgoals that are not likely to 
prove useful, to select one from those remain- 
ing to work on, and to choose particular axioms 
and theorems to use in generating new subgoals. 
The machine does depart frop^ this procedure 
in a number of circumstances (in setting up an 
indirect proof, for example), but these cases 
account for only a small fraction of the total 
search time. 

The computer program itself was writ- 
ten within the framework of the so-called 
Newell -Shaw -Simon list memory^. In order to 
ease the task of writing so massive and complex 
a machine code, a convenient special-purpose 
list-processing language was designed to be 
compiled by the already available^FORTRAN 
system for the IBM 704 computer . The au- 
thors feel that had they not made free use of an 
intermediate programming language, it is like- 
ly that the geometry program could not have 
been completed. 


Summary of Results 


Since its initial solo perfbrmance, the 
geometry machine has existed in several dif- 
ferent configurations. In its earliest and most 
primitive form, the system was equipped with 
a single major semantic heuristic 0 . That first 
system was, however, able to prove a large 
number of interesting, though admittedly simple 
theorems in elementary plane geometry''. The 
heuristic rule in question, which is independent 
of the particular formal system under consider- 
ation, may be described in the following way. 

All subgoal formulae that are generated at a 
given stage of the proof-search are interpreted 
in a model of the formal system; in our case, 
the model is a diagram, a formal semantic in- 
terpretation. If the interpreted subgoal is valid 
in the diagram, it is accepted as a possible step 
in the proof, provided that it is non-circular^. 
Otherwise, it is rejected. 

As an experiment, a number of attem- 
pts were made to prove extremely simple 
theorems with the latter heuristic "disconnec- 
ted" from the system (i.e. , all non-circular 
subgoals generated were accepted). In each 
case, the computer's entire stock of available 
storage space was quickly exhausted by the 
initial several hundreds of first level subgoals 
generated, and, in fact, the machine never 
finished generating a complete set of first 
level subgoals. We estimate conservatively 
that on the average, a number of the order of 
1 000 subgoals are generated per stage by the 
decoupled system. If one compares the latter 
figure with the average of 5 subgoals per stage 


accepted when the diagram is consulted by the 
machine, it is easy to see that the use of a dia- 
gram is crucial for our system. (Note that the 
total number of subgoals appearing on the prob- 
lem-solving graph grows exponentially with the 
number accepted per stage.) 

Since the procedure described above is 
a heuristic one, errors are occasionally made 
in the selection or rejection of formulae as sub- 
goals. The diagram is made available to the 
machine in coordinate representation to finite 
precision. Formulae are interpreted by trans- 
forming them into an appropriate calculation on 
the numerical coordinates representing the point 
variables. For example, to check the validity 
of a statement concerning the equality of two 
segments, the length of each segment in the 
figure is calculated, and they are then compared 
to a certain preassigned number of decimal 
places. If, instead, the statement concerned 
parallel segments, the slopes would be calcula- 
ted and compared. In a small number of cases, 
roundoff error has propagated beyond the al- 
lowed value, so that valid subgoals were reject- 
ed, or invalid ones accepted. It is important to 
point out, however, that in no case could this 
effect result in a false proof. Where valid sub- 
goals were rejected, the machine found alter- 
nate paths to the solution. Where invalid ones 
were accepted, the machine failed, of course, 
to establish them within the formal system. In 
the worse possible case, the interpretation error 
could prevent the computer from finding any 
solution at all, but never could it lead to an in- 
valid proof. 

It should be clear at this point that the 
diagram is used only to guide the search for a 
proof by supplying yes or no answers to ques- 
tions of the form: "Is segment AB equal to seg- 
ment CD in the figure? ", or "Is angle ABC a 
right angle in the figure? ". There is no direct 
link between the diagram and the formal system 
in the geometry machine. The behavior of the 
machine would not be changed if the coordinate 
representation were replaced by a device that 
could draw figures on paper and scan them. 

In the basic theorem-proving system 
described above, after a set of subgoals has 
been generated, each member of the set is ex- 
plored in order. The next subgoal in line is not 
examined until the one preceding it has been 
followed down to a dead end. Too, in generating 
the next level for a given subgoal, every applic- 
able theorem available is pressed into service. 

This system was soon extended by the 
introduction of selection heuristics for both sub- 
goals and subgoal-generating theorems. The 
subgoal selection heuristic assigns a "distance" 
between each subgoal string and the set of 
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premises in a vaguely-defined ad hoc formula 
space. At each stage, the next subgoal selec- 
ted is that which is "closest" to the premises 
in formula space. The generator selection 
routine recognizes certain classes of subgoals 
that are usually established in one step. For 
such "urgent" subgoals, the appropriate gener- 
ator is withdrawn immediately, and an attempt 
is made for a one -step proof (of that particular 
subgoal) before generating the full set for that 
formula. 

The extended system is able to prove a 
number of somewhat more difficult theorems 
that are beyond the capacity of the basic 
machine^ . For those problems within the 
range of both systems, the former is, on the 
average, about three times faster, and gener- 
ates about two -thirds the total number of sub- 
goals in half as many subgoal generation cycles 
as required by the basic system. The average 
depth of the problem-solving graph for the re- 
fined system, about seven to nine levels, is 
two-thirds the average depth for the basic sys- 
tem. 

By the addition of a simple construction 
routine, the theorem-proving power of the 
machine is expanded to include an entirely new 
class of problem, hitherto logically unattain- 
able. The routine, called upon only when all 
other attempts have failed, allows the machine 
to join two previously unconnected points in the 
diagram, and extends the newly-created seg- 
ment to its intersections with all other segments 
in the figure. The new segment, when it inter- 
sects previously given ones, introduces new 
points into the problem which are named by the 
machine and become part of the problem system. 

At this stage in its development, the 
geometry machine was capable of producing 
proofs that were quite impressive (Appendix I). 
Its performance, however, fell off rapidly as 
the number of points in the diagram increased. 
This effect was due largely to the fact that un- 
like humans, who generally identify angles vis- 
ually by their vertices and rays, the computer 
specifies an angle by a predicate on three vari- 
ables, the vertex and a point on each ray. Con- 
sequently, the equality of angles 1 and 2 in 


figure 1 below may be represented in thirty- 
six different ways, since each angle has six 
different names. Formal rigor demands, too, 
that the equality of angles ADH and EDG, for 
example, be proved rather than taken for 
granted. It should be clear that where the con- 
dition above exists, the search for a proof 
quickly bogs down in a mass of uninteresting 
detail. 



Figure 1 

In the current system, the angle prob- 
lem is solved by allowing the machine to use 
the diagram to identify a given angle with its 
full set of names, and to assume the equality 
relationship between different names for the 
same angle, as does its human counterpart. 

The geometry machine in its present configur- 
ation is able to find proofs for theorems of the 
order of difficulty represented by the following: 

THEOREM: If the segment join- 
ing the midpoints of the diagonals 
of a trapezoid is extended to in- 
tersect a side of the trapezoid, it 
bisects that side (Appendix II). 

Limitations of the System 

It will be immediately evident to those 
familiar with the properties of formal logistic 
systems that unless a construction which gen- 
erates a new point is introduced by the machine, 
all problems are solved within the framework 


* In the proofs appended to this paper, the non-obvious predicates have the following interpretations: 


OPP-SIDE XYUV 
SAME -SIDE XYUV 
PRECEDES XYZ 
COLLINEAR XYZ 


Points X and Y are on opposite sides of the line through points U and Y. 
Points X and Y are on the same side of the line through points U and V. 
Points, X, Y, and Z are collinear in that order 
Points X, Y, and Z are collinear. 
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of a propositional calculus, however complex 
its structure. Although the machine's present 
construction routine can and does generate new 
points, we could not expect our results to be of 
great interest to logicians until a full set of 
possible constructions (corresponding to a com- 
plete set of existentially quantified axioms) is 
made available to the system to abet its search 
for a proof. 

An equally serious limitation on the 
formal generality of the theorem-proving 
machine is imposed by our method for determ- 
ining the well-formedness of strings within the 
logical system. In order to attain the neces- 
sary speed and efficiency in processing, well- 
formed formulae are defined by schema rather 
than recursively. The kind of statement that 
can be made in the system is then determined 
by the schema available to the machine. The 
practical effect of this loss in generality is to 
restrict rather severely the freedom with which 
algebraic statements in geometry may be man- 
ipulated. 

In addition to the above, there are a num- 
ber of non-essential bounds on the theorem- 
proving ability of the machine. These are a 
consequence of the limited speed and memory 
capacity of the computer for problems of such 
highly combinatorial character. Improvements 
in either of the above will be immediately effec- 
tive in extending the class of machine -solvable 
problems in both quantity and difficulty. 

Conclusion 


The initial goal of our research program 
in machine intelligence has been attained. If 
the interrogator were to restrict his probing to 
the area of theorem-proving in elementary Euc- 
lidean plane geometry, our machine could be 
expected to give an excellent account of itself in 
competition with a human in Turing's well-known 
"imitation game" ^ . Of course there are many 
other problem areas (solving arithmetic prob- 
lems, for example) where computers have al- 
ways been able to compete successfully with 
humans. The significant point is that a know- 
ledgeable interrogator would certainly avoid 
such areas in his questioning, while he might 
well (until now, at any rate) introduce a plane 
geometry problem in a calculated attempt to 
separate the men from the machines 1 -^ . Al- 
though the stage is now set for the argument that 
any distinct area of human intellectual activity 
will in the same way succumb to the inexor- 
able logic of electrons, switches, and gates, 
we defer to our philosopher colleagues for de- 
bate on the implications of that contention, at 


least until the time that computers have been 
programmed to consider such issues. 

There are a number of consequences 
of our work that are, fortunately, more con- 
crete than that alluded to above. Perhaps the 
most important are those relating to inferential 
analysis, a new branch of applied logic first 
characterized by Wang ^ . Inferential analysis 
"treats proofs as numerical analysis does cal- 
culations", and is expected to "lead to mechan- 
ical checks of new mathematical results" and, 
more important, "lead to proofs of difficult new 
theorems by machine". It is expected that our 
techniques for the manipulation and efficient 
search of problem solving trees^nd our results 
concerning syntactic symmetry will prove to 
be useful tools in pursuing the goals of inferen- 
tial analysis. 

Contributions have been made, too, in 
the area of techniques for computer implemen- 
tation of complex information processes. Re- 
sults pertaining to the design and use of inter- 
mediate languages for the specification of list 
manipulation processes have been reported else- 
where^. The latter work indicates clearly the 
requirements of a digital computer system de- 
signed for optimum execution of such list proces- 
ses. In brief, a list-processing computer 
should possess hardware facilities for: 

1) Generalized indirect addre s - 
sing; specified in the indirectly 
addressed instruction to arbitrary 
depth and in arbitrary order from 
either the left or the right field of 
a two-address register, 

2) Effective address recovery; 
making available the terminal 
content of the address register 
(the final address in a long and 
complex indirect address chain, 
for example) as the address field 
for a subsequent operation, 

3) Field logic; a greatly expan- 
ded set of interfield operations 
within a full register sectioned 
according to some previously es- 
tablished convention, and 

4) List search operations; a list 
equivalent of the conventional 
table look-up instruction. 

The bulk storage input-output requirements for 
a list-processing computer are severe, and are 
not included in the enumeration above. The sys- 
tem design of a digital computer for the manip- 
ulation of list structures will be described in 
detail in a subsequent paper. 
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Finally, we consider the implications 
of our work for the basic problem of machine 
intelligence. The geometry machine, we feel, 
offers convincing evidence of the power and 
fruitfulness of heuristic programming for the 
solution of problems of a certain class by com- 
puter. In our experience, the theorem-proving 
power of the machine has often been extended by 
the addition of a single heuristic to a degree 
equivalent to a three to fivefold increase in the 
speed or storage capacity of the computer. 

Our program has proved to be disap- 
pointing as a tool for the study of the more 
elementary trial-and-error types of machine 
learning, largely because of the rather low rate 
at which it accumulates experience. It is 
reasonable to expect, however, that the geom- 
etry machine might yet be pressed into service 
in an investigation of the higher, conceptual 
types of machine learning, providing that one 
will someday know how to formulate the prob- 
lem. 

If nothing else, our work offers some 
qualitative indication of the order of magnitude 
of difficulty for problems that could be expec- 
ted to yield to contemporary computer tech- 
nology. Three years ago, the dominant opin- 
ion was that the geometry machine would not 
exist today. And today, hardly an expert will 
contest the assertion that machines will be 
proving interesting theorems in number theory 
three years hence. 
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Appendix I 


PREMISES 

************** 

QUAD-LATERAL ABCD 
POINT E MIDPOINT SEGMENT AB 
POINT F MIDPOINT SEGMENT AC 
POINT G MIDPOINT SEGMENT CD 
POINT H MIDPOINT SEGMENT BD 

TO PROVE 
************** 

PARALELOGRAM EFGH 


SYNTACTIC SYMMETRIES 
************************** 

BA* AB* DC* CD* EE. HF, GG. FH ♦ 

CA* DB* AC* BD* GE. FF » EG* HH , 

DA. CB. BC. AD. GE. HF , EG* FH , 

PROOF 

*********** 


c 



A D 


SEGMENT DG EQUALS SEGMENT GC 
DEFINITION OF MIDPOINT 
SEGMENT CF EQUALS SEGMENT FA 
DEFINITION OF MIDPOINT 
TRIANGLE DCA 

ASSUMPTION BASED ON DIAGRAM 
PRECEDES DGC 

DEFINITION OF MIDPOINT 
PRECEDES CFA 

DEFINITION OF MIDPOINT 
SEGMENT GF PARALLEL SEGMENT AD 

SEGMENT JOINING MIDPOINTS OF SIDES OF TRIANGLE IS PARALLEL TO BASE 
SEGMENT HE PARALLEL SEGMENT AD 
SYNTACTIC CONJUGATE 
SEGMENT GF PARALLEL SEGMENT EH 

SEGMENTS PARALLEL TO THE SAME SEGMENT ARE PARALLEL 
SEGMENT HG PARALLEL SEGMENT FE 
SYNTACTIC CONJUGATE 
QUAD-LATERAL HGFE 

ASSUMPTION BASED ON DIAGRAM 
PARALELOGRAM EFGH 

QUADRILATERAL WITH OPPOSITE SIDES PARALLEL IS A PARALLELOGRAM 


TOTAL ELAPSED TIME 1*03 MINUTES 


Appendix II 


PREMISES 

************** 

QUAD-LATERAL ABCD 

SEGMENT BC PARALLEL SEGMENT AD 

POINT E MIDPOINT SEGMENT AC 

POINT F MIDPOINT SEGMENT BD 

PRECEDES MEF 

PRECEDES AMB 

TO PROVE 
************** 

SEGMENT MB EQUALS SEGMENT MA 

NO SYNTACTIC SYMMETRIES 
***************************** 


B C 



A K D 


AM STUCK, ELAPSED TIME 8.12 MINUTE! 

"SEGMENT cf 

EXTEND SEGMENT CF TO INTERSECT SEGMENT AD IN POINT K 


ADD TO PREMISES THE FOLLOWING STATEMENTS 
**************************************** 


PRECEDES CFK 
COLL I NEAR AKD 


PROOF 

*********** 

SEGMENT BC PARALLEL SEGMENT AD 
PREMISE 
COLL I NEAR AKD 
PREMISE 

SEGMENT KD PARALLEL SEGMENT BC 

SEGMENTS COLLI NEAR WITH PARALLEL SEGMENTS ARE PARALLEL 
OPP-SIDE KCDB 

ASSUMPTION BASED ON DIAGRAM 
SEGMENT DB 

ASSUMPTION BASED ON DIAGRAM 
ANGLE KDB EQUALS ANGLE CBD 

ALTERNATE-INTERIOR ANGLES OF PARALLEL LINES ARE EQUAL 
PRECEDES CFK 
PREMISE 
PRECEDES DFB 

DEFINITION OF MIDPOINT 
ANGLE KFD EQUALS ANGLE CFB 

VERTICAL ANGLES ARE EQUAL 
SEGMENT DF EQUALS SEGMENT FB 
DEFINITION OF MIDPOINT 
TRIANGLE FDK 

ASSUMPTION BASED ON DIAGRAM 
TRIANGLE F8C 

ASSUMPTION BASED ON DIAGRAM 
TRIANGLE FDK CONGRUENT TRIANGLE FBC 

TWO TRIANGLES ARE CONGRUENT IF ANGLE-SIDE-ANGLE EQUALS ANGLE-SIDE-ANGLE 
SEGMENT KF EQUALS SEGMENT CF 

CORRESPONDING SEGMENTS OF CONGRUENT TRIANGLES ARE EQUAL 
SEGMENT CE EQUALS SEGMENT EA 
DEFINITION OF MIDPOINT 
TRIANGLE AKC 

ASSUMPTION BASED ON DIAGRAM 
PRECEDES CEA 

DEFINITION OF MIDPOINT 
SEGMENT EF PARALLEL SEGMENT AK 

SEGMENT JOINING MIDPOINTS OF SIDES OF TRIANGLE IS PARALLEL TO BASE 
SEGMENT EF PARALLEL SEGMENT KD 

SEGMENTS COLL I NEAR WITH PARALLEL SEGMENTS ARE PARALLEL 
SEGMENT FE PARALLEL SEGMENT BC 

SEGMENTS PARALLEL TO THE SAME SEGMENT ARE PARALLEL 
PRECEDES MEF 
PREMISE 
COLL I NEAR MEF 

ORDERED COLL INEAR POINTS ARE COLL I NEAR 
SEGMENT FM PARALLEL SEGMENT BC 

SEGMENTS COLLI NEAR WITH PARALLEL SEGMENTS ARE PARALLEL 
SEGMENT FM PARALLEL SEGMENT DA 

SEGMENTS PARALLEL TO THE SAME SEGMENT ARE PARALLEL 
TRIANGLE DBA 

ASSUMPTION BASED ON DIAGRAM 
PRECEDES AMB 
PREMISE 

SEGMENT MB EQUALS SEGMENT MA 

LINE PARALLEL TO BASE OF TRIANGLE BISECTING ONE SIDE BISECTS OTHER SIDE 


TOTAL ELAPSED TIME 30.68 MINUTES 
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A SUGGESTED MODEL FOR INFORMATION REPRESENTATION 
IN A COMPUTER THAT PERCEIVES, LEARNS, AND REASONS'* 

Peter H. Greene 

Committee on Mathematical Biology 
The University of Chicago 
Chicago, Illinois 


Summary * This paper investigates how compu- 
ters might represent enough of the structure of 
the percepts and concepts they handle so that they 
may sensibly be said to deal with the meaning of 
these things, rather than just to sort and recom- 
bine mere labels for the operator's percepts and 
concepts. One of the main requirements is that 
each element of information contain partial repre- 
sentations of many other elements and schemata for 
their interconnection. Some of these requirements 
may be met if it proves feasible to represent in- 
formation in the form of vectors (such as modes of 
oscillation of a complicated network or resonator) 
which may be resolved into components in various 
coordinate systems. These systems represent vari- 
ous points of view from which the information may 
be regarded, and some of the information in each 
System is elicited by a probabilistic mechanism 
for use by a conventional computer. 


This paper forms part of a program?- of inves- 
tigating mechanisms whereby computers can repre- 
sent enough of the structure of the percepts and 
concepts they handle so that they, rather than the 
human operator, can be said to deal with the 
meaning of these things. The earlier paper stated 
some of the main goals j this paper proposes a 
model which may achieve some of these goals. We 
saw^ that present-day computers deal primarily 
with external relations among concepts which are 
given in a form that does not represent their 
inner structure . Thus the concepts may be sorted 
and combined, but their meaning resides in the 
mind of the operator. We saw that an important 
task which precedes comparison and abstraction is 
the formation of impressions into representations 
adequate to sustain abstraction. Moreover, this 
primary challenge of forming impressions into 
logical elements is inseparably connected with the 
formation of a rule which gives the ordering and 
interdependence of the logical elements. Each 
element must contain partial representations of 
many other elements and schemata for their inter- 
connection. We explained that if we regard the 
concepts of the computer as copies of a definite 
world of facts, we are doing the computer's job 
of carving out significant units. The particular 
units constructed by the computer would depend up- 
on the generative principles of connection we have 
mentioned, and we were led to expect clusters of 
related thought precursors formed around indivi- 
dual generative principles built into the computer. 
We spoke of the need for analyses of meaningful 


units richer than their description in terms of 
conjunctions of atomic parts or their negations. 
Finally, we discussed problems involved in the 
important task of becoming acquainted with causal 
relations and the potentialities of things and 
actions — what would happen in situations not 
actually existing. The bibliography cited impor- 
tant studies of psychological behavior exemplify- 
ing all the preceding considerations. These 
studies, together with the philosophical studies 
there cited and with comments on the relevance of 
these ideas to computers, are reviewed in a pre- 
vious paper. ^ 

This paper proposes a new model for the 
representation of information in a computer which, 
if it proves feasible to realize, would lead to 
all the features of behavior that we have called 
for in the preceding summary, including certain 
structural properties of behavior resembling many 
of the psychological properties of perception, 
learning, and higher mental functions. A problem 
which wil.1 remain to be solved before such a 
model can become useful is to specify the precise 
way in which the properties will appear and get 
those situations to be just the ones we are in- 
terested in for practical reasons. Some of the 
motivation for part of the model was presented in 
an earlier paper by the author.^ The present 
paper explains the model itself in a more sys- 
tematic way and introduces features with numerous 
consequences not then known. Table 1 summarizes 
the various parts of the argument and indicates 
their interdependence. 

Initially the model was intended to simulate 
important features of Gestalt perception, with 
the -understanding that a more adequate theory 
could not arbitrarily separate perception and 
thinking. It was therefore very gratifying to 
discover that upon the introduction of a missing 
part of the model required for the understanding 
of Gestalt perception, the model without further 
extension predicted a type of behavior resembling 
theoretically inferred mechanisms of learning, 
thought processes, and certain integrated action 
patterns performed by animals. 

In a crude pattern recognizer capable of 
recognizing a class of patterns such as the front 
door lock of an apartment building, the discrimi- 
nating features, or "perceptual units," are 
rigidly built into the objects to be discrimi- 
nated. But we know that more sophisticated 
recognizers cannot be provided with such ready- 
made units, and it is our job to understand how 
the task preceding discrimination, namely the 
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formation of stable units to be recognized and dis- 
criminated, may be accomplished. A machine (or 
person) lacking this process would combine all 
sorts of stimuli into meaningless groups. The 
model presented here is intended for that part of 
a machine which produces the stable, meaningful 
units, and is thus intended not to replace but to 
supplement other types of perceiving or reasoning 
devices. 

Now let us state the kind of mechanisms which 
will be used, although only the subsequent discus- 
sion will reveal what connection they have with 
the problems that we have outlined. Information 
is to be coded as patterns of oscillation of com- 
plicated resonators or networks of oscillators. 

In particular, these patterns may be resolved into 
superpositions of normal modes of various kinds, 
and these normal modes serve as the symbols for 
certain stable percepts or concepts — e.g., for 
"good" Gestalten. The same pattern may be re- 
solved into superpositions of any one of a number 
of complete sets of normal modes. Such transfor- 
mations are carried out reversibly, and the modes 
of any such set will be thought of as representing 
possible outcomes of the elicitation of informa- 
tion regarding one particular aspect of the total 
pattern. The above processes are intended to 
simulate human processes which are normally never 
at the level of awareness. Elicitation of this 
information for responses and simulated percep- 
tion at the level of awareness will require the 
following mechanisms (for reasons which cannot be 
guessed before the explanations which follow) : 
First, the total pattern must be split into one of 
the possible sets of normal modes. Second, each 
such mode mast be multiplied and averaged with the 
same shot noise, and the mode thus yielding the 
largest result selected to produce the percept or 
response and then returned to the population of 
total patterns. Subject to slight elaboration 
later, these are all the mechanisms we shall need: 
coding as modes of oscillation, together with a 
mechanism for elicitation involving random selec- 
tion and recombination. For concreteness, I like 
to think of microwave modes in some sort of com- 
plicated resonator, but I do not know whether such 
a realization might be feasible. Even though no 
realization is known, it still seems worthwhile to 
derive unexpected psychological properties of 
mechanical processes. 

The argument will be presented by indicating 
the new perceptual features introduced as the 
properties of the model are incorporated, one by 
one. At each stage of the argument appropriate 
behavioral desiderata, chosen from the topics of 
the first paragraph, will be shown to motivate 
the incorporation of the next mathematical proper- 
ty of the model. Thus the specific mathematical 
model just outlined is derived from such topics, 
which will form the bulk of this paper, and must 
not be regarded merely as philosophical back- 
ground. Once these arguments are understood, and 
not before, the simulated psychological aspects 
of the behavior of the model may be pointed out 
rather simply. 

Since this paper will propose a new means of 
information representation in computers, it is 
important first to have for a guide an intuitive 


idea of the kind of thing which is being at- 
tempted. .We wish to build a machine that per- 
forms the necessary steps prior to discrimination 
and abstraction, and we are particularly inter- 
ested in the versatility displayed in the pro- 
cesses of pattern formation and stabilization. 
Perception automatically adjusts itself to varia- 
tions of the stimuli in a multitude of integrated 
ways, and it continually leads to the formation 
of new meaningful perceptual elements. Thus we 
seek a model which offers the possibility for 
such flexibility. Heuristically speaking, we 
shall not seek numerous models in each of which 
some class of response patterns is "built in," 
but rather a model which may be conceived of in- 
tuitively as a sort of arena in which complicated 
patterns can spring up, interact, and evolve "by 
themselves." We cannot hope to specify perceptual 
processes in all their complexity, but must allow 
them to arise in an evolutionary process of ela- 
boration in which what is given consists of cer- 
tain primordial elements, certain laws of trans- 
formation, and the opportunity to carry out these 
transformations. This is in accord with many 
mechanisms which are believed to operate in human 
perception and thought. 

In the previous presentation of 'motivation 
for part of the model, ^ reasons were explained 
for wishing to represent l 1 good" Gestalten by 
normal modes which had mathematical properties 
S imil ar to those of eigenstates and transforma- 
tion amplitudes in quantum mechanics (although 
the physical realization has nothing whatever to 
do with quantum mechanics) . This is because 
these mathematical structures lead to theorems 
specifying behavior depending upon relational 
aspects of complex configurations and resembling 
perceptual phenomena. The first part of this 
paper will add further psychological features to 
the model and will be self-contained with no 
essential reference to quantum mechanics. 

In accord with the program of studying the 
initial stages of perception which provide stable 
perceptual units for discrimination, we assume 
the existence of a device that can recognize per- 
ceptual units which we have transformed into suf- 
ficiently stable and standard form. Thus we 
start with the existing level of technology of 
pattern recognition. What might we add by 
developing the idea of coding as modes of oscil- 
lation of complicated and changeable networks or 
resonators? 

First of all, we have the obvious fact that 
our perceptual units will be very complicated. 
They might be sorted and combined in the same 
ways as conventional symbols, but in addition 
they possess elaborate internal structure which 
may be used as a vehicle for the expression of 
relationships with other symbols and an agent or 
object of transformations induced by interactions 
with other symbols. Next, we have the equally 
obvious fact that complicated modes of oscilla- 
tion "spring up by themselves" in the sense of 
our intuitive requirement, specified perhaps 
only by some frequencies or simple symmetries of 
the impressed energy. These patterns are more 
complicated than any we could build directly, 
and this phenomenon of a complicated pattern 



springing up from a much simpler one might allow 
for richness of perception and thought without 
making unreasonable demands upon the complexity of 
the coding and memory. Evidence for the storage 
of ideas in the form of thought precursors is 
cited in a previous bibliography 1 and reviewed, 
and such a method might provide a logical exten- 
sion of the procedure of Shaw, Newell, Simon, and 
ELlis* in constructing their Information Process- 
ing Language. In this language the data are not 
inert and structureless but are provided in the 
form of data programs, and the data are obtained 
by executing these programs. A list of data may 
be specified by a list of processes that determine 
the data. These authors explain that their ap- 
proach leads to a computer that contains at any 
given moment a large number of parallel active 
programs frozen in the midst of operation and 
waiting until called upon to produce the next 
operation or piece of data. Development of our 
model might provide feasible ways of "freezing" 
extremely complicated programs. 

Two topics which immediately come to mind 
in connection with modes of oscillation are the 
superposition of patterns into complicated struc- 
tures and the resolution of complicated patterns 
into linear combinations of basic oscillations. 
Although there is no more information in a linear 
superposition than in the components, we shall 
deliberately be working with a language in which, 
as befits perception, some information will be in 
the focus of attention while other information 
will temporarily be relegated to the background. 
Otherwise, only chaos would be perceived. We 
shall see later how our symbols will contain the 
latter kind of information in a latent form and 
how this information can be brought into the 
awareness of the recognizing. part of the machine 
by a process which consigns other information to 
latent status. Thus the available information in 
superposition may be different from the available 
information in the components. 

The preceding paragraph suggests that we are 
interested in observing the same pattern from dif- 


ferent points of view in order to bring out dif- 
ferent pieces of information. It seems natural to 
investigate the simplest possibility consistent 
with our concern for the superposition and 
resolution of modes, namely that the various 
points of view are represented by the components 
of the complicated symbol in various coordinate 
systems. That is to say, we resolve our symbol 
into combinations of various kinds of modes. 

What new perceptual features would this introduoe? 
Since the components of the symbol in various 
coordinate representations are related by linear 
transformations, we shall have, once the observa- 
tion procedure is explained, a schema for the 
connection and interdependence of observations 
called for in the first paragraph. This rule of 
connection will not be as rigid as it may seem at 
first because flexibility and variety will be 
introduced by the "latent" information in each 
representation which shows up only in other 
representations, so that from the point of view 
of the recognizing system, one representation 
does not tell all about the others. Second, the 
process of utilization of information will itself 
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introduce probabilities and possibilities for re- 
combination and variation of symbols, and this 
process can lead to the evolution of more complex 
forms. 

In accord with the first paragraph, we see 
that any perceptual element in one coordinate sys- 
tem will contain partial representations of many 
other elements in that system or other systems. 
Moreover, linear transformations between repre- 
sentations do not coordinate particular parts of 
patterns in one representation with particular 
parts in another but are of a holistic nature, in 
accord with the first paragraph and with element- 
ary facts of Gestalt perception. 

Let us introduce more structure by consider- 
ing the mathematical properties of the coordinate 
systems. The natural kind of coordinate system 
is provided by normal modes of oscillation, or 
more precisely, eigenfunctions of lineal operators 
which characterize the system, that is, functions 
which are merely multiplied by a constant when 
the operators are applied to them. Later on we 
shall explain how these normal modes will provide 
precisely defined patterns which we shall identify 
with "good" Gestalten. What new perceptual 
characteristics are thus introduced? First of all, 
we have the general feature that stable perceptual 
units would have to appear in certain discrete, 
reproducible, integrated forms, which may change 
dis continuously from one to another with nothing 
in between. This is such a characteristic fea- 
ture of human perception that we often tend to 
forget those instances in which we do see indefi- 
nite and merging forms. Evidence that stable 
percepts rest upon a basis of merging, streaming, 
scintillating, and reduplicating forms is adduced 
by Schilder.* This is the kind of thing our modeL 
tries to do; that is, to base stable perceptual 
units upon wave-like functions which permit super- 
position, decomposition, and transformation. 

Specifying the operators of our system deter- 
mines these normal modes which will constitute the 
stable perceptual units. Thus the system actively 
"carves out" units, in the terminology of the 
first paragraph, by means of operators built into 
the system or attained through a process of learn- 
ing. When we discuss the process of elicitation 
and utilization of these perceptual elements, we 
shall see the interrelations among clusters of 
elements belonging to the same operators, and we 
shall derive behavior related to the clusters of 
thought precursors formed around individual 
generative principles mentioned in the first para- 
graph. We shall also derive a certain inter- 
changeability in the early stages of perception 
among the modes belonging to a single operator. 

Our last observation at this stage pertains 
to the linearity of the system. Characteristics 
of the system will be represented by operators, 
and perceptual units by combinations of their 
eigenfunctions. As a consequence, we have the 
fact that although the operators may be related 
by complicated nonlinear relations, the percept- 
ual units will be linearly related. This results 
from the general mathematical fact that the eigen- 
functions of any of the operators we shall con- 
sider can be linearly superposed to produce any 
function we shall, require. 
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Our observations so far have been very gen- 
eral. Nonetheless, they are essential both to the 
derivation of the model and to the comprehension 
of the purpose of the model once derived. Our 
method so far has been to assume that a useful 
model can be constructed and to examine what the 
various parts will do. Only when we know this can 
we know how to put them together. Our conclusions 
so far, supplemented by the equally general com- 
ments upon selective awareness and group proper- 
ties of perceptual transformations, will, as 
indicated in Table 1, lead to conclusions with a 
great deal of content. 

Our next stage will be to see what the opera- 
tors and eigenfunctions mean. We shall frequently 
be interested in the results of a number of dif- 
ferent types of observation of the perceptual unit 
forming system by the recognizing system. Let us 
suppose that an observation of some type, which we 
shall label A, might reveal any one of the states 
a l, a 2 , etc., an observation of type B one of the 
states bi, b2, etc., and similarly for observation 
G, and so on. In describing events at a percept- 
ual or mental level of organization, we custom- 
arily use a form of language in which we are able 
to formulate laws governing the influence of past 
events upon the present without considering the 
details of trace structures which exist at all 
intermediate times. We therefore seek a formalism 
which connects behavior at a single earlier obser- 
vation with that at a single later observation, 
rather than providing a continuous description of 
observations. In particular, we shall often be 
interested in the conditional probabilities 
p ( a i) a j)> p ( a il bj), etc. that an observation of A 
will reveal the state &± } given that a previous 
observation has revealed a particular state. In 
situations where an A observation must yield some 
a state, we have the equality VPCa^jbj) =1 
for an arbitrary state bj. ' 

We characteristically do not see all aspects 
of an object simultaneously but typically find 
one aspect at the focus of attention, while other 
aspects remain indefinite until some effort is 
made to perceive them clearly. If the perceptual 
experience depends in some statistical fashion 
upon a set of "observations," i.e., elicitations 
of coded information by the recognizing part of 
the machine, we might expect that observations of 
the first 'type all yield the same result, while 
results for the second type of observation will 
be less coherent. If we interpret our P’s above 
to apply to two observations in this observed 
set, we might then have a situation in which 
P(aj c |a^) * (i.e., 1 if k = i and 0 otherwise), 

while P(bj |ajj might take on various values. 

This statement is intended to cover a number of 
situations. For instance, we may suppose that 
two observations are performed in immediate suc- 
cession, the second being applied to the infor- 
mation elicited as the result of the first. Then 
the requirement states that if an observation of 
type A applied to the original information has 
revealed the information a^, then further obser- 
vations of type A applied to the elicited infor- 
mation aj_ can only continue to reveal a^, while 
an observation of a different -type B applied to 
ai might reveal any of the bj. According to a 


second interpretation, elicited information func- 
tions slightly differently. Suppose that a par- 
ticular perception or action of type A is not 
correlated with a single element ai, but rather 
with a particular distribution of weights among 
all the elements of the set a 2> •*..}. A 
natural way of assigning weights is to perform a 
number of independent A observations upon the 
same information, thus obtaining a population of 
a’s, and to let the weight of a^ be the proportion 
of a^’s in that population. Then the probability 
P(a k |ai) may be interpreted as the probability 
that a randomly chosen member of the population is 
ajj, given that a previously randomly chosen member 
was a^. The condition P^lai) =$ ^i then states 
that all the members of the population are the 
same, so that the percept or act has only one 
nonvanishing component, and thus the condition 
allows for the existence of such "elementary" 
acts. A weaker requirement might allow for the 
existence of several, but not all components. 

This requirement, general as it may be in isola- 
tion, will in connection with our other general 
requirements shortly turn out to impose very 
specific requirements on the mathematical struc- 
ture of the model. 

The next stage in the development of the 
model is to try to combine this statistical for- 
malism with the system of linear transformations 
which we have previously discussed. In order to 
simulate important features of human perception, 
we wish to find ways of relating the conditional 
probabilities between pairs of sets so that 
transformations may be compounded, or so that they 
may be decomposed into transformations involving 
intermediate sets, as for example by expressing 
P(c|a) in terms of P(e|b) and P(b\a). These 
transformations must preserve the sums of proba- 
bilities, so that they add up to one as before. 

Our first thought is to let the previously dis- 
cussed linear transformation scheme be identical 
with the well-known matrix multiplication of 
probabilities, P(c|a) = £ P(c|bj)P(bj(a) . 

i 

However, if we now refer back to the discussion of 
selective awareness in the preceding paragraph, 
we see that we shall in general be unable to ob- 
tain the situation there described, in which 
P(ajj.|aj_) » 0 if k / i. The reason is that if the 
same matrix combination is to hold for all kinds 
of observations, we find by replacing c by a^ and 
a by aj_ in the multiplication rule that PCakJai) 
will be the sum of several positive terms, and 
this contradicts the requirement that it equal 
zero. That is to say, we can get from a^ to 
via some of the b's, contrary to our desire, 
unless we can define a transformation according to 
which many of these paths cancel each other. 

Land^,°» in works which gave me ny first 
notions of what to do about ny conviction that 
something useful might come of analogies between 
quantum mechanics and perception, points out that 
there is just one known system of transformations 
which will do. If we restrict ourselves to the 
case of symmetrical probabilities, P(b|a) = 

P(a|b), (a restriction for which certain heuristic 
justifications may be offered), then the transfor- 
mations may be defined by complex valued matrices 



I'M = IIY(a k (b 3 )l| , such that the squared 
modulus t'S'kjl* of the element equals the cor- 
responding probability P(ak|bj) . Then it can im- 
mediately be seen that the requirements that the 
probabilities add up to one and that they satisfy 
the condition for selective awareness are equiva- 
lent to the condition that the matrices just de- 
fined be unitary matrices (the’ complex analogue 
of orthogonal matrices). We shall call the ^ 1 s 
by the name which analogous quantities have in 
physics, probability amplitudes, to distinguish 
them from the probabilities P. 

Of course, we are not denying the validity of 
the well-known law of matrix multiplication of 
probabilities in favor of the matrix multiplica- 
tion of probability amplitudes. The two rules 
apply to different situations, the former deter- 
mining, as it must, the probability that c is 
elicited given that b and a have been elicited, 
and the latter determining the probability that c 
is elicited if only a has been elicited, express- 
ing this probability in terms of two conditional 
probabilities that refer to observations not 
actually performed. This difference of course 
requires that we introduce a suitable precisely 
defined way in which the elicitation of b modifies 
the information. 

Now we may connect this discussion of proba- 
bility amplitudes with the previous groundwork 
concerning transformations between coordinate 
systems by remembering that the unitary matrices 
are precisely the matrices which transform between 
coordinate systems defined by sets of orthogonal 
and normalized vectors. And now we may turn the 
argument around and say: given a coding of infor- 
mation as complex oscillations, with components in 
various coordinate systems, we wish to define a 
process which will elicit the information corres- 
ponding to a particular component with a probabili- 
ty proportional to the squared modulus of the 
amplitude of that component. As emphasized in 
Table 1, we are led to this result only by combin- 
ing the above desiderata at the behavioral level, 
and with this result we begin to collect the 
promised fruit of all the discussion that has gone 
before. Once we define a suitable mechanism for 
elicitation of information, we shall be able to 
use the discussions of qualitative properties of 
mathematical structures to show how the model will 
simulate certain important structural character- 
istics of thought. Then, when we add a suitable 
mechanism for the change of the ^ in time (or 
with respect to some other parameter), we shall be 
able to derive some typical properties of Gestalt 
perception. 

We shall shortly define a mechanism for the 
elicitation of information with a probability 
equal to the squared modulus of its amplitude. 
Accepting for the moment the assertion that this 
can be done, we may immediately derive operator 
and eigenfunction equations and thereby specify 
the eigenfunctions which will be the coordinate 
vectors. We may expect the observable properties 
of the oscillatory states to depend upon numerical 
functions which take different values in different 
states, because the input of the recognizing part 
of the machine can always be expressed in such a 
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form. These values will be precisely defined 
without any statistical spread (we shall say 
"sharp") in the set of states for which some 
perceptual aspect is definite. In such a set of 
observations of type A, let us denote the pre- 
cisely defined numerical value in state by 
the same symbol a^. Then for the mean value of 
a over all states corresponding to observation A, 
we clearly have a = £akP(akJs) = 2 s) 

it «* 

a k^( a kl s ) , ^ definition of the relation between 
¥ and P, where s denotes the state of the system 
in an arbitrary coordinate system. What will be 
the mean value of a in terms of the B coordinate 
system? If in the expression for a we perform 
the linear transformation to the B representation, 
we find that a s) , where A 

is the linear operator represented in the B sys- 
tem by the matrix having elements A^ = 

(aj c |b m )aj c ''J r (aj c |b n ) . According to this formalism, 
the operator A is represented in the A system by 
the diagonal matrix Ayyi = $ ^ t . This standard 
kind of computation shows the origin of the 
operators mentioned in our discussion of stable 
configurations . 

Now all one has to do to obtain eigenfunc- 
tion equations defining the coordinate systems is 
to multiply both sides of the defining equation 
for Ajjja by (b n J a-j_) and sum over n, making use 
of the unitary nature of the ¥ ‘ s. The result is 
the eigenfunction equation AY( b ml a i) = a i * 

(b m ^ a i) . Given the matrix representing A > one 
may solve this equation to obtain the various aj 
and corresponding • This equation 

states that the transformation amplitude matrix 
from any set of states to the set in which a 
particular type of observation is sharp must be 
an eigenfunction of the operator corresponding to 
that observation in the former set of states, and 
the numerical value a^ identifying the state must 
be the associated eigenvalue. We began by merely 
requiring a^ to be some number associated with 
the state. The present conclusion follows en- 
tirely from the requirement that probabilities 
be the squared magnitudes of the ¥ 1 s, and we 
were led to this requirement by considering gen- 
eral behavioral desiderata. 

Examination of the mathematical meaning of 
the Y's reveals that the eigenfunction "'{*’( b m | a^) 
is simply the set of coordinate values repre- 
senting an a vector in the B system, so that we 
may alternatively consider basic coordinate vec- 
tors to be eigenvectors of operators, or the 
transformation amplitudes to be eigenfunctions of 
operators. It is easy to showb>V,o y^at our 
assumptions have implied that the operators must 
be Hermitian, and their eigenvectors will thus be 
orthogonal and provide suitable coordinate sys- 
tems. 

In a previous paper^ I discussed a mechanism 
for the successive transformations’ of a set of 
inputs to a network, which produced a group of 
transformations depending upon a parameter T. 

This parameter was the analogue of time in quan- 
tum mechanics, although in the case of the net- 
work it denoted spatial distance across a network 
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from an input mosaic to an output mosaic. It was 
briefly noted that if we had a network containing 
a large number of alternate paths for the signals, 
each path being specified by distributed phase 
lags, we would obtain transformations of coordi- 
nate systems described by equations analogous to 
the Schrodinger equation, namely zy/dv= 4.H¥, 
where H is a Hermitian operator defined in terms 
of the phase lags. Such a formalism provides one 
interpretation for the earlier statement that nor- 
mal modes will be stable, for the dependence of 
eigenfunctions of N will be entirely exponential 
with an imaginary exponent and will cancel out of 
the observation probabilities when we take the 
squared magnitudes in the elicitation process. 

We must ultimately be more specific about the de- 
tailed nature of H and other operators which 
determine the behavior of the system, because we 
will not be satisfied merely with quasi-percepfcual 
behavior in unspecified situations. However, we 
shall be able to draw a large number of conclusions 
which follow from the formalism alone and which 
will thus be true independently of the precise 
nature of the operators. These sometimes enable 
us to work backwards from behavioral desiderata 
and thereby eliminate whole classes of possible 
operators. However, since this paper will be con- 
fined to conclusions which follow from the gen- 
eral formalism alone, we shall not discuss the 
problems of further specification of operators. 

We shall explicitly refer to the transformation 
equation of this paragraph only in connection 
with certain so-called "Gestalt constancies." 

The last mechanism we shall have to intro- 
duce is the one which selects normal modes with 
a probability proportional to their amplitudes. 

The problem is as follows: Suppose the oscilla- 
tory pattern representing, for instance, all possi- 
ble ways of looking at a complicated stimulus is 
a function ^ of a number of variables, which we 
shall not specify in detail. For Illustrative 
purposes we may consider the excitation at some 
point to be a function of time, V* (t) . We have 
supposed all along that ^ may be resolved into 
superpositions of various kinds of normal modes, 
or eigenfunctions. Thus, alternatively, ^ (t) = 

XI Sn^n^) or = 2t> n ^n(t), etc., depend- 

ing upon the point of view from which the total 
configuration is being observed. Suppose the 
point of view is such that 'f'' is split into the 
0 n . The coefficients a n will be complex numbers. 
We want to specify a mechanism which will pick 
out one of the normal modes and discard the rest, 
in such a way that the probability of picking the 
kth mode <p k is equal to l a kl* ', or actually to 
i%i 2 /?ki 2 in case V is not normalized. How 
does one pick something with a probability propor- 
tional to the square of a complex number? 

Wiener? >10 has considered this question in connec- 
tion with the identical situation in quantum 
mechanics, and his conclusion is that he can think 
of only one non-quantum mechanical process which 
could do that. Therefore I decided to examine 
the consequences of incorporating such a process 
in the present model, and it was at this point 
that the model turned out automatically to pos- 
sess many interesting structural properties of 


thought, together with the ability to learn by 
being "rewarded" in a simple way. 

This selection process is quite simple and 
will now be described. Wiener gives two variants, 
of which only one will be considered here. Let 
Y(t) be the integrated complex-valued output of a 
shot noise generator. Then dY, if it existed, 
would be the instantaneous output. Actually, this 
is a physically unrealizable idealization of a 
Gaussian random process. What one actually meas- 
ures is the increment AY for a small increment of 
time. But the intuitive notation can be given a 
precise sense. It turns out that the integrals 

J 4>n (t)dY(t) are all random variables with 
Gaussian distributions, and in case the <£> n are 
all normalized the distributions will all be the 
same. Moreover, if the are orthogonal to 
each other, that is, St £(t)^ m(t)dt = 0 for 
n ?= m, then the Gaussian distributions will be 
mutually independent. Our general considerations 
led us to consider Hermitian operators, which have 
orthogonal eigenfunctions, so that this condition 
is satisfied. How let us calculate the time 
averages of the normal components of V' multiplied 
by the shot noise: A n = / a r ^ n (t)dY(t) . This is 
accomplished by a physical device which finds the 
statistical correlation of two inputs. Finally, 
let these numbers A n control a gate which lets 
through the mode having the largest A n . Since 
the A 1 s are random variables, this process picks 
the modes statistically, and the probability of 
picking the kth mode turns out, as shown by 
Wiener, to be just what we want, namely 
\ a k\ 2 / £ la n l We shall not go into detailed 
consideration of the suitability and implications 
of different ways of comparing the A’s,l0 except 
to say that if comparisons are made in pairs, the 
order in which the pairs are picked makes little 
difference, and that it is also possible to com- 
pare pairs of superpositions of several modes 
and then split these up further . This latter 
point will have a useful interpretation. A pos- 
sible way of performing the process is indicated 
in Figure 1. This general type of process turns 
out to have numerous interesting consequences, 
which we are finally ready to understand, all the 
groundwork having been prepared. 

Restrictions on the length of this paper pre- 
clude all but the briefest discussion of these 
consequences. Further information, especially in 
connection with an attempt to understand the 
representation of information in the brain, will 
be published elsewhere. However, making use of 
the background of information which we have 
developed about the properties of the mathematical 
structures, we may derive most of the interesting 
features in a few sentences. Here is a list of 
them. 

We begin with the ways in which information 
coded as modes of oscillation is modified in the 
process of elicitation by the recognizing part of 
the machine. The information exists in a popula- 
tion of "wave functions" , which are split up 
in various ways into superpositions of components 
<f> n or n, etc. Tbe process of elicitation 
operates upon one of these resolutions and selects 



one or more components, ignoring the rest. The se- 
lected components will then be identified or pro- 
duce some action or be subjected to further trans- 
formations, and they may or may not subsequently 
be returned to the population of wave functions, 
depending upon the purposes to be served. Observ- 
ing the same information from a different point of 
view is interpreted as splitting the wave function 
into a different set of components and then select- 
ing some of these components. The various compo- 
nents in any particular resolution correspond to 
various ways the pattern could appear when looked 
at from the corresponding point of view, or to 
various activities in a particular mode of behav- 
ior. 

If there is just one Y (instead of a popula- 
tion of them), then an observation from one point 
of view might elicit any one of the 0's, but sub- 
sequent observations from the same point of view 
will continue to elicit the same component. This 
is one way in which an ambiguous "percept" becomes 
clarified in the process of bringing it to the 
"awareness" of the recognizer. Another, and pos- 
sibly more significant, way will be explained 
when we come to the effect of the shot noise on 
the latent information. 

If the information selected by the elicita- 
tion process is returned to the population of 
wave functions, an interesting phenomenon results. 
There will be a repeated process of splitting the 
wave function into components of some type, dis- 
carding some of the components, returning the 
remaining randomly selected components to the 
population, splitting them into other types of 
components, and so on. This process brings about 
the reappearance of components which have pre- 
viously been discarded. For instance, if only the 
<f> 1 component is retained, and it is split into a 
superposition of several ^ components, and only 
one of the ^ 1 s is retained, this | will in gen- 
eral when resolved in the 0 system contain many 
of the 0's, and thus contain 0 1 s which had pre- 
viously been discarded. This is a perfectly ele- 
mentary fact in any vector space, which has two 
immediate consequences for us. The first is that 
if an input excites one mode, then the result of 
the resolution and selection process is to excite 
other modes in the same coordinate system. In 
other words, the model exhibits a form of associa- 
tion . It is a standard problem in probabilities 
to calculate the association strength as a func- 
tion of such things as the number of iterations 
of the resolving-selecting process, and the prob- 
lem is being investigated. The second consequence 
will be explained in connection with learning. 

The probability of finding a particular value 
for the numerical function which identifies the 
possible results of some type of observation is 
determined by the squared magnitude of the corres- 
ponding amplitude and is thus independent of any 
factor of the form ej“. This means that an arbi- 
trarily large amount of information may be included 
in the phases of complex exponential factors of 
the amplitudes of modes belonging to some type of 
observation without making any difference in the 
results of that observation. However, this infor- 
mation will affect the results of a different kind 


of observation. For example, suppose that Y - 
+ a 2 e^2^, and that <h - '4 f (i|i)^i + 
'F(2|i)| 2 « Then in an observation which reveals 
0 *s, the probability of finding 0^ and its 
associated numerical value will be l a ll? indepen- 
dent of and oC 2 . However, in the ^ system 

S' = [y(l|l)a 1 e3« , l . V(ll2)a 2 e' i0t 2]^ 1 + 
nrUllKe^l + >l'(2|2)a 2 e5*2]^ 2 , so the 
probability of observing ^ -j. will be |¥(lll)l 2 * 

| a;L l 2 + l'¥(l|2)J 2 |a 2 | 2 ^^^(l|l)'F(l|2)aJ a 2 • 

e j ( V*L) * ^(l|l)^(l|2)a 1 a|e^ ctl - <fe ), which 
depends veiy much upon ol-j. and 0C 2 . This is what 
was meant by the statement that in any representa- 
tion there would be latent information which did 
not show up in that representation but only in 
others. We first introduced the idea of select- 
ive awareness by requiring that the states in the 
focus of attention be clearly distinguishable, 
while the others are uncertain. The present con- 
sequence shows that each point of view will 
reveal information which will not be revealed by 
other types of observation. We may look at this 
fact in two ways. One way is to say that a per- 
ceiver must at any time disregard many things in 
the stimulus pattern in order to have any intelli- 
gible perception. The converse way is to say 
that knowledge of one aspect or even several will 
not include all the information needed to under- 
stand other aspects and their interrelations. 

The additional information required is the rule 
for the ordering and interconnection of impres- 
sions-*- ’mentioned in the first paragraph. This 
and several other interpretations of the meaning 
of the mathematical formalism are well known in 
the analogous situations in quantum mechanics, 
and. credit has previously been given^ to the 
sources of some of these formulations. 

The next three consequences concern Gestalt 
perception. If a clear perception or an inte- 
grated action depends upon the coherent behavior 
elicited from a population of wave functions , 
then we might have a situation in which the 
clarity depends upon all the V/‘s being the same. 
This is called a pure case of an assembly in 
quantum mechanics^ otherwise the assembly is a 
mixture . In connection with the preceding con- 
sequence, we may note that an observation from 
one point of view wi ll turn a pure case into a 
misture with the same response probabilities in 
that point of view, but will disrupt the coher- 
ence of responses from other points of view. It 
is a simple mathematical fact that a composite 
system may be pure while its subsystems are not. 
Moreover, an observation of one subsystem may 
produce a change in the pureness of another sub- 
system. This fact has been discussed^ in rela- 
tion to the well-known facts of Gestalt percep- 
tion that (a) a stable perceptual unit, or "good 
Gestalt," may become unstable and hard to see if 
embedded in a larger pattern, and (b) what was 
originally a single Gestalt may be broken up in 
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various ways by the addition of new lines in the 
visual stimulus pattern, so that the new Gestalten 
will not coincide completely with any of the old 
ones. Illustrations were given in that paper. 

This is of great importance in a perceptual 
machine, because we might wish to scan a pattern 
to find a meaningful pattern hidden it it, while 
at the same time we do not wish every chance 
grouping of lines in a stimulus to be perceived 
as a meaningful pattern. 

The next Gestalt property rests upon the 
previously considered assumption that the trans- 
formation given by an equation like jH’K 

the solution of which is some function 

a^n* In lhls case ‘ the following situation 
is well known in quantum mechanics. Suppose that 
conditions are changed in such a way that H 
becomes a function of a parameter . Then the 
solution of the equation will likewise become a 
function of /* which we may write (/*) « a^ 

(H*n> where a^O) = a^. The new response 
probabilities will be l a n(f fc )l Now one might 
expect the change in to be roughly of the same 
order as the change in f< . It turns out instead 
that the change in a^ is of the same order as the 
rate of change of/* , that is, df/dv . Thus, 
provided that /* changes very gradually, it can 
become quite large without inducing a significant 
change in the response probabilities. Examples 
were given3 of very important analogous phenomena 
in perception, the Gestalt constancies. In order 
to perceive objects as meaningful units, one must 
ignore certain inhomogeneities, and it is well 
known, for example, 'that large spatial variations 
of luminance are not perceived, provided that 
they are gradual; but they become prominent when 
any discontinuity is introduced. An illustration 
has been explained in detail. 3 

The third Gestalt property likewise requires 
a wave equation. There is some reason to believe 
that while the constancy phenomena would occur 
with a wide class of wave equations, these equa- 
tions must be first order in T, as written above. 
In this case the time dependence of a wave func- 
tion which is an eigenfunction of Hwill be 
exponential with an imaginary exponent, and thus 
w ill cancel out of the response probability. 
Probabilities arising from a superposition of 
such steady state solutions will, however, be 
time dependent, so that one source of change is 
derived from the interference of such steady 
state solutions for different states associated 
with H. It is possible to make the interpreta- 
tion that certain perceptual changes, inexplic- 
able when only one point of view is considered, 
come from the information, not available to 
awareness in that point of view, about the range 
of possibilities of other, aspects. Moreover, one 
can calculate the frequencies at which the per- 
ceptual transitions will occur. Because of the 
determinate form and possible discreteness of the 
normal modes, a change might be as abrupt as in 
the perception of the well-known picture of a 
staircase which can look as though it is being 
viewed either from above or from below. In gen- 
eral, the resolutions of V into * s and into 
5 » s given in connection with latent information 


may be used to predict a possible interference 
effect of a second stimulus introduced into the 
field. The situation is analogous to the inter- 
ference effects observed when an electron has Idle 
opportunity to go through two holes. 

Next we shall consider a general feature of 
the model which resembles the mode of organiza- 
tion of certain highly integrated action patterns 
in animal behavior. Reference to Figure 1 reveals 
that the information is elicited by a process 
which has two stages: first, splitting the wave 
function into normal modes, and second, selecting 
a mode by a mechanism utilizing shot noise. The 
first stage requires either very specific filters 
which separate the modes, or else (as shown) 
precise copies of the modes which can be used to 
obtain the various components in the same way one 
finds terms in a Fourier expansion. The second 
stage requires that the noise level be high 
enough so that the results of the integrations 
may exceed any thresholds which exist in the 
component which coup ares them. There is evidence 
from the study of instinctive behavior in ani- 
mals (previously cited 1 and reviewed^) that com- 
plicated action patterns are broken down into a 
sequence of acts, each one released under the two 
conditions that a very specific stimulus be 
present and that a drive level be high enough. 
People are trying to build machines that do 
remarkable things, and it seems wise to see how 
those things actually are accomplished in nature. 
Even if the detailed mechanisms of instinctive 
behavior are not the same as those of the present 
model, it may be of interest that they can be 
described by the same flow charts at sqme level 
of abstraction. 

Next we shall examine some ways in which the 
behavior of the model resembles structural 
characteristics of thinking. We have mentioned 
the importance of the fact that the wave function 
is split into many modes and some are selected 
randomly. The random selection, aided by sys- 
tematic biases, of which we shall later give one 
example in connection with learning, can lead to 
the evolution of complex forms. It seems to be 
typical of thought and action processes that many 
alternatives are proposed out of which some are 
selected. In perception we have, for instance, 
the previously mentioned evidence that stable per- 
cepts emerge from a conglomeration of wavelike 
images. In instinctive behavior there is evidence 
that the animal behaves in a searching or random 
way until the two previously mentioned conditions 
release the block to action and allow selection 
of a particular action pattern. In thought 
pathology there is some evidence that a large 
variety of symbolically related ideas can emerge 
as the final thought product, which would be sup- 
pressed precursors of normal thought processes. 

In the development of thought in an infant, 
according to hypotheses of Freud, various rudimen- 
tary images become associated with drives. In 
the absence of the tiling which will satisfy the 
drive and reduce its level, these images become 
activated in some way which raises them to 
hallucinatory vividness (like the images in 
dreams) . If the activation level is below 
threshold, the necessary concentration may be 
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achieved by utilizing energy from another image, 
or by fusing several images related to the same 
drive so as to pool their activation energy. Any 
such image may stand for the drive in this form of 
thought, termed the primary process by Freud. 

Ideas may stand for their opposites, because at 
this stage there is no mechanism to distinguish 
between evoking an image to affirm it or to deny 
it. Thus ideas are associated by their relevance 
to the same drive, not because of formal logical 
relationships. Moreover, mutually contradictory 
ideas can exist side by side, for there is no 
mechanism for comparing them and rejecting one or 
both. We can find familiar examples of such 
thinking in the substitutions and fusions of 
images in dreams and in slips of the tongue, but 
it is the aim of the arguments- 1 - cited in the first 
paragraph of the present paper to make clear the 
fact that any organized and independent thought 
processes, however logical their outcome, must 
employ such a process. While such thinking is 
indispensable, it must be supplemented by a 
secondary process which enables the infant to get 
along in the world. Under the influence of 
learning by checking his ideas with the real en- 
vironment, he must voluntarily delay the drive 
induced discharge of images so they are in accord 
with reality, and he must make connections, not 
between ideas which are related merely to the same 
wish, out between ideas which are related by 
virtue of having the same relation to actual ex- 
perience. The absence of this unlimited fusion of 
ideas results in less pooling of the activation 
energy, and the representations become less vivid, 
taking the form of thoughts instead of images of a 
hall ucinatory nature. These thoughts are now mani- 
pulated logically so that they may lead to actions 
producing gratifying changes in the environment 
rather than merely the hallucinations of grati- 
fication. In logic itself it appears, ^ that not 
only on the lowest levels of the synthesis of mean- 
ing but at the highest propositional levels, too, 
reasonable statements may be produced only by a 
process of rejection of a multitude of lawfully 
produced but unreasonable statements. 

If these hypotheses are true, thought, while 
remarkable, does not appear inconceivable, and it 
seems worthwhile to see whether such processes may 
be built into machines. An example of a machine 
program which has some features of this generation 
of clusters of ideas around individual goals fol- 
lowed by selection of some of them by means of an 
evaluative process is described by Newell, Shaw, 
and Simon- 1 - 1 - in a discussion of chess playing 
machines. Their program contains a subprogram 
built around a set of about a dozen goals, each 
corresponding to some feature of the chess situa- 
tion. Each goal has associated with it a move 
generator and an analysis and evaluation procedure. 
The move generator associated with a goal proposes 
alternative moves associated with that goal, re- 
gardless of suitability, on the basis of any con- 
nection with that goal. The evaluation and 
analysis procedures determine the value of the 
move from the point of view of that goal alone. 

The analysis procedure is concerned only with the 
acceptability of a move once it has been generated 
by the move generator. The consequences of the 


moves generated by all the goals are explored by 
evaluating possible game continuations generated 
by a different generator from the one which pro- 
poses moves. An executive routine makes the 
final choice of an acceptable move from the fifty 
or so proposed moves. 

The present approach is to try to get things 
like this to happen by themselves without having 
to specify precise programs in advance, in order 
to allow the spontaneous elaboration of symbolic 
complexity. Let us examine the way information 
is elicited in the present model. There are two 
stages in this process. In the first stage the 
wave function If/ , from which all points of view 
may potentially be extracted, and which may even 
represent a large number of ideas in some inte- 
grated portion of the machine's "mind," is split 
into components in one or another coordinate sys- 
tem. Two possible resolutions of the wave func- 
tion are represented by the hexagons and squares 
in Figure 13. As we have proved mathematically, 
each of the components contains some latent 
information, represented by the shading, which 
is not available to the "awareness" of the 
recognizing part of the machine, but which be- 
comes transformed in other resolutions into infor- 
mation which can be elicited. This latent infor- 
mation is thus necessary to the reversible trans- 
formations between different points of view. At 
this stage of merging and splitting all informa- 
tion is present only in the form of potentiali- 
ties. Suppose that V' is resolved into tf> com- 
ponents and the shot noise generator is turned 
on. One of the 0 's will then be elicited, pro- 
vided that the noise level is high enough so that 
the threshold of the comparing component is ex- 
ceeded. The expansion coefficients give the 
probabilities for elicitation of the various 4> 's, 
but there is no way of knowing precisely which 
one will be elicited in a given case. Thus the 
noise level acts in a way similar to the drive 
level which elicits images. Now suppose that the 
noise begins at a very low level, too low for the 
elicitation of one of the 0 's. The threshold 
may be exceeded, however, by the result of the 
combination of the noise with a superposition of 
two or more of the 0 's, since this result will 
be + aj0j + ...,)dY instead of merely 

i a k$ k^> and the corresponding elicitation 
probability will be proportional to la^ ( 2 + )aql^ 

+ ... instead of merely i%|^. Thus superposi- 
tions of images will be elicited, in analogy to 
the condensations of dream images. In general, 
at this stage there is no way to refer to a mode 
except by exciting it, and the association be- 
tween modes comes from belonging to the same 
resolution of the wave function, i.e., in our 
interpretation, belonging to the same point of 
view. 

Now let us examine the second stage, that is, 
the transformations which the symbols may undergo 
after having been elicited by the random process. 
Let us suppose that the elicited output does not 
come directly from the resolving apparatus 
(dotted line in Figure 1A) but comes instead via 
the multiplication component which receives its 
input from the resolving apparatus (solid line). 
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Let A Y(t) be the instantaneous output of the 
noise generator, which would have no well-defined 
meaning if the noise were ideal shot noise, but 
in practice will be Y(t + At) - Y(t) over some 
short interval of time At. Then A Y(t) will be 
a random variable with a Gaussian distribution 
which is independent of the distribution of 
AY(t') if It - t *1 > At. Tinder these assump- 
tions, the output will no longer be a k ^> k (t) , but 
will be a^j^t) AY(t) . Let us suppose that 0 i 
and <p 2 have been elicited with their respective 
probabilities, so that we now have a new popula- 
tion of wave functions 0l(t) AY(t - ti) and 
0 2 (t) AY(t - t 2 ). The AY's will be at different 
times because the modes in the population are 
separated out at different times. The two modes 
will be present in the relative proportions ( aj 2 : 
|a 2 |2. This population, as we shall see in a 
moment, may be described by the wave function 
a l4 > l( t )^ Y ( t " ^l) + a 2 0 2 (t)AY(t - t 2 ), which 
gives the right probabilities of elicitation, pro- 
vided that the comparing element averages a popu- 
lation of inputs, because the average of UY(t)l 2 
is always the same constant for all t. Now let us 
return to our previous discussion of the meaning 
of the latent information, which represents 
potentialities in other modes of observation. In 
the expressions for ^ in the 0 and systems, 
let us replace the exponentials e^*! and e***2 by 
AY(t 4- t]_) and AY(t +• t 2 ), respectively. 
Averaging over the population and making use of 
the independence of AY(t + ti) and AY(t + t 2 ), 
we find that only the first two terms of the ex- 
pression in the ^ system remain, the interference 
terms having disappeared. The elicitation proba- 
bility for will thus be wnuri -► 
WH2)f|a 2 | 2, which is the same probability we 
would have predicted by considering the actual 
population of wave functions 0 q Ay(t + tl) and 
0 2 AY(t + t 2 ) in the relative proportion lajhja^ 
rather than the combined wave' function 4>i AY, + 

0 2 AY, This justifies our use of the combined 
wave function. However, before the wave function 
has interacted with the eliciting apparatus, 01 + 
(ft 2 "will not give the result as a population of 
0]_' s and 0 2 's because of the interaction terms. 

It may also be seen that the probability 
which has just been given for is precisely 
that which would have been obtained by using the 
matrix combination of probabilities instead of the 
matrix multiplication of probability amplitudes. 
The preceding argument has thus fulfilled the 
promise made in the beginning of this paper that 
an elicitation mechanism would be designed which 
would make the group properties of transformations 
between coordinate systems consistent with the 
necessarily valid rule of composition of response 
probabilities. This argument is, of course, fa- 
miliar in quantum mechanics, the only difference 
being that in quantum mechanics the random multi- 
pliers are slightly different (<# is a random 
variable in the factor e^** ) , and that we are 
using an ordinary mechanical device to do the 
multiplication. ¥hat is new about the argument 
is that the resulting model is being proposed for 
a concrete device which has many features of 
thinking, so we shall now examine the behavioral 
implications of the above formal mathematical 
property. 


Prior to the interaction with the eliciting 
device, the modes of any resolution contain all 
the information needed for their recombination 
and subsequent resolution in another coordinate 
system. The original wave function could have 
been subjected to any kind of observation and thus 
could have exhibited any one of a number of prop- 
erties. These properties may be mutually incom- 
patible, so that no individual elicited wave func- 
tion could exhibit more than one, although they 
might all be found in a population of responses. 

An individual wave function elicited by one of 
the kinds of observation may no longer contain the 
information which tells about the other kinds of 
observation, so that awareness of one aspect may 
preclude simultaneous awareness of other aspects. 
But all these mutually incompatible properties 
may simultaneously exist in the form of incom- 
pletely developed potentialities in the original 
wave function. This is another respect in which 
the first stage of the present model resembles 
primary process thinking. 

We may look at this another way. The origi- 
nal wave function was potentially describable by 
a wide variety of predicates applicable to the 
ideas represented by the elicited modes. But 
interaction with the eliciting apparatus has re- 
vealed some potentialities and at the same time 
has in a precise mathematical way destroyed the 
information needed to put the modes together again 
and resolve them in a new way which can reveal 
other potentialities. Thus the wave function 
after such interaction can reveal a smaller set of 
potentialities. It may not yet have been recog- 
nized by the recognizing part of the machine, so 
that from the standpoint of the "awareness" of the 
machine it may still be unknown, but it will now 
be an unknown 0 or else an unknown ^ , rather 
than an unknown something which could reveal 0 
aspects or £ aspects, or any number of other 
potential aspects. Even if the properties coded 
as 0 i> 0 2 5 etc. exhaust all logical possibilities 
with respect to that aspect of the total pattern, 
so that in one sense saying that it is a <f>-^ or 
< J> 2 , etc. is a tautology, this information none- 
theless has important behavioral consequences for 
the machine even before it is recognized by the 
machine. We saw mathematically that this w ill 
make a difference in any subsequent elicitation 
of information from another point of view. In 
addition, knowing which aspect is relevant has 
important behavioral consequences because it en- 
tails using the appropriate rule of ordering and 
interdependence-*-* 2 mentioned in the first para- 
graph of this paper. This is a necessary stage 
of concept formation which precedes abstraction, 
and it is important because it tells the 
machine, in a sense, along what dimension to 
think. We may summarize this paragraph by saying 
that the elicitation process, even before its 
results are "noticed, " has partially crystallized 
the information by limiting the set of potential 
predicates. If, in our example, 0 2 eac * 1 

represented a superposition of many modes, this 
crystallization would have been only partial, for 
while the <f> and 0 2 have been multiplied by 
independent AY's so that the interaction terms 
between them have been destroyed, all the modes 



within 0 ^ or 02 have been multiplied by the 
same AY, which makes no difference wit hin th e 
subspace of their superpositions (since l/YYl^ = 
constant) . Thus within these subspaces poten- 
tialities exist which may be destroyed by further 
specification. All the above corresponds both to 
theoretical interpretations of underlying thought 
mechanisms and to common sense observations about 
gradual articulation of concepts which are not 
clear to begin with. 

Our final observation about the second stage 
of information elicitation concerns logic. In 
distinction to the mode of representation of in- 
formation in the first stage, which permits the 
same wave function to contain mutually incom- 
patible potentialities, and which permits merging 
and resplitting, the information is now in a form 
in which only one of the potentialities has been 
realized. If the information in this form is 
utilized by the recognizing part of the machine, 
it will be displayed in a form which follows the 
usual rules of probability, as we have seen, and 
which may be manipulated by the usual logical 
procedures employed in computers. In fact, it is 
well known in quantum mechanics that the mathe- 
matical formalism used to describe the analogous 
features of observation — namely the projections 
of wave functions on particular coordinate axes — 
can be partially described in terms of the Boolean 
algebra of propositions.-^ Thus we may summarize 
this discussion by saying that information is 
changed in the process of elicitation into a form 
more amenable to logic, but at the expense of 
richness of interconnections. Thus the machine 
may use this modified form to explore the logical 
consequences of some concept, but in order to do 
something more creative it may have to return to 
the original form and start again. 

So far we have seen how the model behaves in 
ways resembling some aspects of perception and 
thought. Now I shall very briefly indicate one 
of the ways in which it can learn. We shall as- 
sume that in some part of the machine information 
is elicited from some pool of wave functions, 
used for some purpose, and then returned to the 
pool of wave functions. This process is shown in 
Figure 2, which in addition summarizes some of the 
preceding discussion. Suppose that the contents 
of the pool consists of N copies of the wave 
function 0 , and that at the moment they are 
split by the 0 filters, so that V = £ a i0i> 

i 

where the 0 ! s are symbols for certain responses. 
Suppose that the shot noise generator is operat- 
ing so that one of the (f) 's will be elicited, and 
let us suppose that we are interested in teaching 
the machine to perform one of the responses, say 
<p j_, rather than any of the other <f) ' s. For the 
first elicitation the probability of obtaining 
4>± is |a]_l . In case this occurs, the operator 
(or some other part of the machine) "rewards" the 
machine by turning off the noise generator, 
heanwhile , the 0 q has returned to the pool, 
which now contains N - 1 of the (p's and one 0^. • 
The probability of eliciting <p ]_ has now become 
l a il 2 (N - l)/N + l/N, -which is larger than before. 
The wave functions will be recombined and re- 
solved again in a number of ways, but since the 
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noise generator has been turned off, no informa- 
tion will travel around the cycle, so that no 
information will be discarded, and there will be 
no further change in the probability of finding 
01. Now suppose instead that the first trial 
resulted in the response represented by one of 
the other 0’s, say 02* This time the operator 
leaves the noise generator on for a while. The 
02 returns to the pool, so that now there are 
N - 1 of the p's and one p 2 * These wave func- 
tions become resolved in other coordinate systems, 
just as in the preceding case, but this time, 
since the noise generator is operating, informa- 
tion will go around the cycle and some of the 
components will be lost. Now let us split the 
resulting wave function into <p components for the 
next trial, and let us see what the new response 
probabilities will be. Referring back to the dis- 
cussion of association, we observe that in the 
process of splitting and recombination the dis- 
carded component Pi will have returned to some 
extent. Thus the probability for 0 2 on the next 
trial will have increased somewhat, but not as 
much as that of 0 1 , had the latter occurred on 
the first trial. The probability of any one of 
the 0’s, say 0ji, will become (l/N)^l -t (N - !)• 

Nhi ■ w I 2 Iyum z . . . WHJil z ki 2 }, 

in which the subscripts n, m, ...k stand for the 
various states in the intermediate coordinate sys- 
tems. The probabilities are not so easy to com- 
pute on subsequent trials and depend upon details 
of how wave functions in the pool become acces- 
sible to elicitation, so that further discussion 
will be postponed until a time when it can be 
done in a thorough and systematic way. However, 
it appears that some reinforcement learning is 
possible, reinforcement acting directly only upon 
the noise generator and not directly upon the 
wave functions themselves. The general idea is 
that the desired function remains, while the 
others are chopped up. The fact that the proba- 
bility of elicitation of one of the 0 * s depends 
upon both the amplitude of the 0 and the level 
of the noise, with reinforcement acting only upon 
the noise, resembles parts of certain theories of 
learning in animals, but this topic too will be 
reserved for more systematic discussion when that 
is possible. 

This paper will conclude with a few brief 
remarks of a more general nature. First of all, 
the mathematical language employed is well suited 
to describe both the fluidity of the initial 
stages of perception and thought, in which ideas 
have the potentiality of being examined on the 
basis of a large number of cross-cutting systems, 
and the final stages in which definite and stable 
relations may be perceived. The language is also 
suited to describe a system in which structures 
are not always defined in advance. As in the 
illustrative example of microwave modes, a source 
of excitation which is only moderately complex 
can interact with a structure which is only 
moderately complex to produce modes which are tre- 
mendously complex, and which could not feasibly 
be constructed in any other way. In addition, 
these structures themselves are given the oppor- 
tunity to become still more highly developed by 
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an evolutionary process of selection and recombi- 
nation. In this sense the model resembles the 
desired arena in -which patterns spring up and 
develop by themselves. In such a system the in- 
formation is stored in the form of incompletely 
defined potentialities, which are realized only 
in interaction with the eliciting and recognizing 
parts of the system. The percepts, for instance, 
are in the combination of stages, not in either 
alone; for in the first stage by itself they con- 
tain all points of view at once and none alone, 
while in the second stage by itself they are in 
the form of the conventional computer representa- 
tion of information, which is no more perceptual 
and subjective than the image on a television 
screen. The full computer might have to contain 
a conventional component for reception and useful 
transformations of the input, followed by the 
present model, followed by a more or less conven- 
tional component to discriminate among the sta- 
bilized images or ideas and produce an output. Of 
course if we were t lying to design a machine -which 
could have a knack for skilled actions, the same 
problems of fluidity of transformations would 
recur, as we may be convinced by noting that our 
signature comes out the same even If we hold the 
pen in the mouth or between the toes. 

Since the information exists only in the com- 
bined input, wave function system, and recognizer, 
one might wonder where the boundaries are between 
the observed system, the observing mechanism, and 
the observer. Von Neumann has shown 1 ^ ^ %he pre- 
cisely similar mathematical situation in quantum 
mechanics; that even though the transformations 
are different in the three subsystems, the com- 
bined result is independent of where the boun- 
daries are drawn. Thus although in this system, 
as well as in the workings of the brain, it is 
impossible to draw sharp boundaries, perhaps that 
is not necessary for understanding the processes 
involved. 

Finally, we have seen that this model makes 
a distinction between items of information which 
can merge and then exert interacting effects in 
new coordinate systems, and items which may be 
sorted and combined but remain- separate and have 
independent effects in any new coordinate systems. 
It is conceivable that the evolutionary process 
of selection and recombination could lead to the 
elaboration of a complicated wave function which 
contained a large body of information which had 
become integrated in a way which is necessary 
for complex behavior. Whether or not such a 
situation could occur depends upon how clever we 
are in designing the proper rules of evolution, 
but the point to be made here is that the 
mathematical language we are using is (as is 
known in quantum mechanics) a good language in 
which to describe such a situation should it 
occur, or for the purpose of trying to make it 
occur. 


1. P. H. Greene, "An Approach to Computers That 
Perceive, Learn, and Reason," Proc. Western 
Joint Computer Conference: 1959 • pp.l8l-l86. 

2. P. H. Greene, "The Fundamental Units of Per- 
ception: An Outline," Air Force Office of 
Scientific J^esearcn- Technical Report, to ap- 
pear in i960. Revision of unpublished outline 
of series of lectures given in the Dept, of 
Psychology, Northwestern Univ., Spring, 1959* 

3. P. H. Greene, "Networks for Pattern Percep- 
tion," Proc. National Electronics Conference. 
Vol. 15; 1959. 

4. J. C. Shaw, A. Newell, H. A. Simon, and T. 0. 
Ellis, "A Command Structure for Complex Infor- 
mation Processing," Proc. Western Joint 
Computer Conference; 1958, and Rand Corp. 
P-1277. 

5. P. Schilder, "Mind: Perception and Thought in 
Their Constructive Aspects," Columbia Univer- 
sity Press, New York, N.Y.; 1942. 

6. A. Lande, "Quantum Mechanics, " Sir Isaac Pit- 
man & Sons, Ltd., London, Eng.; 1951. 

7. A. Land£, "Foundations of Quantum Theory: A 
Study in Continuity and Symmetry," Yale Uni- 
versity Press, New Haven, Conn.; 1955. 

8. A. Land<£, "Quantum Mechanics, From Duality to 
Unity," Amer. Scientist. 47:341; 1959. 

9. N. Wiener, "Nonlinear Problems in Random 
Theory," The Technology Press of The Massachu- 
setts Institute of Technology and John Wiley 

& Sons, Inc., New York, N.Y.; 1958. 

10. N. Wiener and A. Siegel, "A New Form for the 
Statistical Postulate of Quantum Mechanics," 
Phvs. Rev .. 91:1551-1560; 1953. 

11. A. Newell, J. C. Shaw, and H. A. Simon, "Chess 
Playing Programs and the Problem of Complex- 
ity, " IBM Journal of Res. and Devt., 2 : 320- 
335; 1958. 

12. J. von Neumann, "Mathematical Foundations of 
Quantum Mechanics," translation by R. T. 

Beyer, Princeton Univ. Press, Princeton, N.J.; 
1951. 



163 

5.3 


Table 1. Interdependence of the Arguments 


Symbols as modes of oscillation 
Much internal structure. 
“Spring up," so easy storage. 


Superposition and resolution 


i 


Components in various coordinate 
systems represent various points ■ 
of view . 

Gives rule of interconnection and 
dependence . 

Holistic nature of transformations. 


1 


Observations of various types 
Conditional probabilities relating 
pairs of observations provide a 
useful description. 

/ \ 

Selective awareness Transforma tio ns 


Normal modes provide coordinate systems . 
Will be identified with stable percepts. 
Reproducible, integrated, often discrete. 
Units "carved out" by system. 

Relation between two systems' always linear 


KH citation of information: Split into 
one of the coordinate sets of eigen- 
functions and correlate with shot noise . 

Destroys the latent information. 

Representations now amenable to logic at 
expense of richness of interconnections. 

Allows learning by reinforcement which acts 

by stopping the shot noise source. 

Simulates some properties of thoughts 
(primary process) and integrated action 
patterns (instinctive acts) . 

Random selection and recombination can lead 
to elaboration of more complicated symbol 
structures. 

Existence of many possible systems of cate- 
gorizations cutting across one another, so 
that what is definite in one is incompletely 
defined in others. 

One cannot and need not draw a definite line 
between environment and observer. 



~> the group of transformations, but must be their 
squared moduli . 

Then's will be the transformation coefficients 
between coordinate systems specified by dif- 
ferent sets of normal modes. 

Each set of coordinates will contain latent infor- 
mation in the form of phases, which will be 
elicited only in other coordinate systems and 
thus represents potentialities. 


I 


Operator and eigenfunction equations . 

Sharply defined numerical values elicited must 
be eigenvalues. 


v 


Kinds of system which give wave equation for 
change in^P^s with time (or other parameter) , 
E.g., multiple paths, specified by phase lags. 


1 


Abilities for Gestalt perception . 

Pure states or normal modes analogous to "good" 
configurations . 

Whether part of a pattern is seen as a Gestalt 
will often depend upon other parts of the pat- 
tern. Problem of seeing parts in a larger 
pattern. 

Gestalt constancies, such as brightness constancy. 
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Figure 1 Flow Chart of the Model 



Figure 2 

Pictorial representation of the functions of the 
various parts of the model, illustrating rein- 
forcement learning. 


A population of wave functions in the pool 
(depicted by the large objects circulating in the 
tank at the left) are split into components of 
some type (the small circular objects) by the 
appropriate separating mechanism (the grate with 
round holes), although they could have been separ- 
ated into different sorts of components (by the 
other grates). The components contain information 
which enables them to combine again (the fringe on 
the circular objects). The noise which acts as 
the eliciting mechanism is depicted by the pump at 
a height representing the threshold. There is only 
room for one component at a time to pass into the 
pipe leading back to the information pool, so that 
all but the first component to reach that pipe will 
drop down and be discarded. In passing through the 
pump the latent information allowing recombination 
has been destroyed (the circles are now smooth). 

The man represents the operator of the machine or 
another part of the machine which can observe the 
action represented by the component which has been 
elicited. To reinforce that response the man turns 
off the pump; if another component has come through 
instead, he merely leaves the pump on for a while. 
He acts only on the pump; the elicited component 
returns to the pool in either case. 









165 

6.1 


ANALOG COMPUTER TECHNIQUES FOR PLOTTING 
BODE AND NYQUIST DIAGRAMS 

G. A. Bekey 
L. W. Neustadt 

Members of the Technical Staff 
Space Technology Laboratories, Inc. 

Los Angeles, California 


Summary 


Statement of the Problem 


Two methods of plotting Bode and Nyquist dia- 
grams with an analog computer are presented. Both 
methods have the advantage of being able to give 
the open loop response from a closed loop simula- 
tion on the computer, which makes them applicable 
to systems with open loop poles in the right hand 
plane. Neither method requires any adjustment by 
the operator, other than to change the frequency 
of the input signal. 

The first method makes use of some trigono- 
metric identities, which yield functions which 
identically equal the real and imaginary parts of 
points in the Nyquist diagram. It makes rather 
extensive use of multiplying equipment, but no 
other non-linear equipment is required. 

The second method makes use of diode and re- 
lay switching techniques to measure phase, and 
peak value readers to measure amplitude . One di- 
vider and no multipliers are used, but diodes and 
relays are necessary. 

Nyquist and Bode plots are presented, as com- 
puted by both methods. 

Introduction 

In the study of linear feedback control sys- 
tems by means of analog computers it is frequently 
important to examine the frequency response of the 
simulated system. Commercially available equip- 
ment can be used to obtain amplitude ratio and 
phase as a function of frequency for an open loop 
system. Unfortunately, many practical control 
systems are either unstable open loop, or they 
contain a number of integrations which result in 
considerable drift unless the loop is closed. The 
techniques presented in this paper resulted from 
an investigation of methods for obtaining the open 
loop phase and gain characteristics of the control 
system from its closed loop simulation on the ana- 
log computer. The information can be plotted 
either in Nyquist or Bode form. 

Two alternative methods are presented in the 
paper. One of these makes use of trigonometric 
relations and obtains the coordinates of a point 
on the Nyquist diagram directly. The other uses 
filtering and switching schemes to obtain the co- 
ordinates of a point on a Bode diagram. Both of 
these methods are capable of obtaining the infor- 
mation from a closed loop simulation and make use 
of standard analog computer components. 


I . Open Loop Measurement 

Consider the simple control system illustrated 
in Figure 1 where the forward transfer function is 
indicated by G(s). If the system is stable open 
loop, and has no poles at the origin, it is possi- 
ble to apply signal of known phase, amplitude, and 
frequency to the system represented by the transfer 


function G(s). Thus, if we let 

e(t) = E sin cut (l) 

we obtain 

c(t) = C sin (cut + <&) (2) 

and the open loop gain K is given by 

K(cd) = ^ =|g(Jo>)| (3) 

while the phase is given by the angle where 

0(<n) = arg G(jo)) (A) 


The Bode diagram consists of the quantities 
K(<n) and $(od) plotted vs. log o>. The amplitude 
ratio or gain is generally expressed in db. The 
Nyquist diagram is the complex plot of the function 

z(co) = K(cn) cos (ZS(<n) + j K(<x>) sin G5 (go) ( 5 ) 

Most commercially available "control system 
analyzers" (Fuchsl) make use of these relations, 
and thus assume that the block G(s) is available 
and that it can be supplied with a controllable in- 
put signal. Various analog techniques are possible 
for measuring the phase-gain characteristics in 
this case. Some typical techniques are the follow- 
ing: 

(a) The in-phase and out-of-phase components 
can be evaluated by nulling the in-phase and out- 
of-phase components of the output with manually at- 
tentuated sine and cosine voltages. This technique 
is described by Larrowe^ . 

(b) The in-phase and out-of -phase components 
of the output can be obtained from calculation of 
the respective Fourier coefficients of the output. 

(c) A variable phase shifter and a filtering 
technique can be used, as described by Cowley3. 

This method relies on careful control of phase and 
amplitude of the input signal to the network in 
question. 

(d) Perhaps the most commonly used, and crud- 


^Numbered superscripts refer to references at the end of the paper 
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est method possible, is the comparison of input 
and output waves on adjacent traces of a strip 
chart recorder. Clearly, the measurements obtain- 
ed by this method are approximate at best. The 
phase measurement can be refined through the use 
of Lissajous figures with either an oscilloscope 
or an x-y plotter. 

In the literature, these methods are describ- 
ed for open loop operation only, and some require 
extensive modification for closed loop operation. 

II. Closed Loop Measurement 


Consider the block diagram of Figure 1 once 
again. It is desired to obtain the phase and gain 
characteristics of the transfer function G(s) from 
a closed loop simulation on the analog computer. 
The requirements for a satisfactory frequency re- 
sponse measuring scheme are taken to be the follow- 
ing: 

(a) It must be capable of obtaining the open 
loop phase and gain characteristics of the system 
without breaking the loop. 

(b ) It must be capable of mechanization with 
standard computer components. If 

r(t) = R sin cot (6) 


then we can write the following steady- state rela- 
tions : 

e(t) = E sin (cot - 9) (7) 

and 

c(t) = C sin (cot - 9 + 6) (8) 


The desired amplitude ratio is given by 

= ifey = | G(ja3) 


(9) 


and the phase shift by the angle $(ca) = arg G ( joi) . 

There are several possible approaches to this 
problem. Two approaches will be described in this 
paper while a third one is described in Reference 
4. 


Trigonometric Method 


The method described below yields a Nyquist 
plot by making use of some simple trigonometric 
relations . 

I. Open Loop Computation 


Let the input to a network be sin cot, and let 
its steady-state output be y(t) = K sin (cot + 0). 
Let us assume that all the poles of G(s) have neg- 
ative real parts. Then in the steady state 



y(t) = A sin cut + B cos cot 

(10) 


- = A cos cot - B sin ait 

(11) 

where 

A = K cos 0 

(12) 

and 

B = K sin <# 

(13) 

From 

(10) and (ll), we have 



A = y(t ) sin cot + — — cos cat 
co 

dM 


B = y(t) cos cat - - sin cat 

co 

(15) 


The Nyquist plot is given by z(co) = A(co) + j B(co). 

Figure 2 illustrates the set-up which was 
used. A network with transfer function 
co Q 2 

G(s) = — g > ( with “b = 1 4 = °* 2 ^ 

s^ + 2c,(a 0 s> + cq-, 

was simulated. The output was plotted on an x-y 
plotter for discrete values of co, and these points 
are shown on Figure 3> with the theoretical curve 
given for comparison. For each frequency the 
value of co was set into the servo to position the 
three pots, and after the transient died, out the 
values of A and B were recorded. Although elec- 
tronic multipliers were used, servo multipliers 
could have been used just as well, if A and B had 
been evaluated in the "hold" mode (Relations (l4) 
and (15) are identities). 

Note that y(t) is easily available whenever 
the denominator of G(s) is of higher order than 
the numerator. If numerator and denominator have 
the same order, y(t) may be obtained from a "90° 

00 

phase shifter", e.g. a network whose transfer 
function is s-co . 

S+CD 

II. Closed Loop Computation 


If G(s), the open loop transfer function, has 
poles on the imaginary axis or in the right hand 
plane, there is no steady state and the open loop 
response must be computed with the loop closed. 

We have the following closed loop relations 


(assuming the input is sin cut, the output is c(t), 
and the transient is zero): 

e(t) = sin cot - c(t) = E sin (cut - 9) (l6) 

■ = cos cut - ^ = E cos (cist - 9) (17) 

as as 

c(t) = KE sin (ast - 9 + ti) (l8) 

= AE sin (ast - 9)+ BE cos (ast - 9) 

where 

A = K cos 0 (19) 

B * K sin <2S (20) 

c fo.) - pi e cos (ast - 9) - BE sin (ast - 9)(2l) 

By combining the above relations we obtain 

AE 2 = c(t) e(t) + m (22) 

BE 2 = c ( t ) e(t) ( 2 3 ) 

E 2 = [e(t ) ] 2 + [^p-] 2 (24) 


Again the Nyquist plot is given by 
z(os) = A(os) + j B(os). A computer diagram for 
solving equations (l6), (17), (22), (23), and (24) 
is given by Figure 4. The solution for discrete 
values of os is shown in Figure 5, together with 
the theoretical Nyquist plot. The transfer func- 
tion G(s) was chosen as (l + ts)/s with t = 0.55. 

The same comments relative to electronic 
multipliers vs. servos, and computation of c(t) 
explicitly apply to the closed loop and open loop 
simulations. 
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Although this method appeared suited for 
drawing continuous Nyquist plots, by letting 
vary continuously in the sine wave generator, at- 
tempts at this were not successful. 

The advantages of this method are the follow- 
ing: Standard elementary analog circuits are used. 
Wo "peak value" readers, sampling circuits, or re- 
lay logic circuits are required, since the values 
A and B are computed for all values of t. With 
further investigation this method may therefore be 
adaptable to a continuous plot. Wo resolver is 
necessary since instead of amplitude and phase, 
the real and imaginary parts are computed. Fur- 
thermore, no adjustment by the operator, other 
than variation of frequency, is required. The 
main disadvantage lies in the amount of multiply- 
ing equipment required - particularly in the 
closed loop calculation. 

Switching and Sampling Method 

The method described below yields phase and 
gain information directly, for plotting a Bode 
diagram. The computation of gain and phase are 
done separately. The method is described only for 
closed loop computation. It can be applied to 
open loop calculations with obvious simplifica- 
tions . 

I. Amplitude Ratio Computation 


Thus, the integrator input is given by: 

(”*i(t) = c'(t) for e(t) > 0 , 

J^i(t) = 0 for e(t) < 0 

This wave form is shown in Figure 7e. 

(b) A 90° phase shifter is employed to ob- 
tain a control signal p(t) equal in amplitude to 
e(t). The phase shifter is the same used in the 
previous method, i . e . 


G (s) 
ps 


S - CD 
S + CD 


(c) When e(t) becomes negative, the inte- 
grator output is sampled for the following quarter 
cycle, while the second control signal p(t) remains 
positive . 

(d) When p(t) becomes negative, it provides 
a control signal to reset integrator 1 to zero. A 
quarter-cycle is provided for the reset operation. 

(e) As soon as e(t) becomes positive, the 
series of operations is repeated. 

The waveforms appearing at the output of the 
integrator are given in Figure "Jf . Since the sam- 
ple-and-hold circuit only samples the final value 
of the integration, its output remains constant 
after steady-state is attained. From an examina- 
tion of Figures 7b and 'Je, it can be seen that the 
integrator output reaches a value proportional to 
the area b minus the area a. Wow, if the integra- 
tor input has unity amplitude, it can be seen that 


The gain is obtained by measuring the ampli- 
tude of the input and output signals and dividing. 
The measurement can be done in a number of ways, 
two of which are shown in Figures 6a and 6b, re- 
spect ively : 

(a) The two signals are full-wave rectified 
and filtered. The resulting dc voltages are di- 
vided. (See Figure 6a) 

(b) "Peak reading” circuits can be used to 
read the positive or negative peaks of the two 
signals. The gain is then given by the ratio of 
the peak values. (See Figure 6b) 

For Bode diagram purposes, the resulting sig- 
nal can be expressed in db, as shown in Figure 6 
with a function generator. 

II. Phase Measurement 


The phase shift is measured by using diode 
and relay switching techniques. Consider the 
waveforms shown in Figure 7- Typical waveforms 
for e(t) and e(t) for a lagging phase shift are 
shown in 7a and 7b- If these two sine waves are 
converted into equal amplitude square waves, we 
obtain Figures 7c and 7d. The phase shift can be 
measured by evaluating the shaded area indicated 
in 7b, as a fraction of the total area for one- 
half period. The area measurement is done with an 
integrator, as shown schematically in Figure 8. 

The following comments refer to Figure 8: 

(a) The error signal e(t) is used to control 
a switch selector system. When e(t) becomes posi- 
tive the output signal c(t), converted to a square 
wave c'(t) of known amplitude, is applied to inte- 
grator 1. When e(t) becomes negative, the output 
signal is disconnected from the integrator. 


b = 


CD 

B - 


( 26 ) 

(27) 


The final value of the integration is then 

ILzJi. ( 28 ) 


I = b - a = 

and thus, the phase angle 0 is obtained from the 
relation 

j Icd - it 


(29) 


This computation; is performed by means of the 
circuitry shown in the block diagram of Figure 8. 

Similar reasoning applies to the leading phase 
shift situation, in which case the area a in Figure 
7e is positive instead of negative. This polarity 
change can be used with additional logic to indi- 
cate lead or lag. 


III. Advantages and Disadvantages 


This sampling method has the clear advantage 
of requiring no complex equipment such as multi- 
pliers for phase determination. The computer does 
not need to be reset to observe changes resulting 
from variation of frequency or system parameters. 
Since relays and diodes are relatively inexpensive, 
a special purpose computer for this purpose could 
be constructed. Theoretically, the method can be 
extended to continuous variation of frequency. 

The application of this method to the computa- 
tion of phase and gain for a simple system, with 
transfer function G(s) = (l + ts)/s2, resulted in 
the curves of Figure 9 and Figure 10. In these two 
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curves the experimental data are given by plotted 
points and the theoretical curves are drawn for 
comparison. 


Conclusion 


The two methods described above show that it 
is practical to obtain Bode and Nyquist informa- 
tion on the analog computer from closed loop simu- 
lations . 
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TWO WAYS OF DETERMINING AMPLITUDE RATIO 
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Figure 7 

STEADY-STATE WAVE FORMS IN SAMPLING METHOD 
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ON THE REDUCTION OF ERROR IN CERTAIN ANALOG COMPUTER 
CALCULATIONS BY THE USE OF CONSTRAINT EQUATIONS 

Robert M. Turner 

Lockheed Aircraft Corporation - Missiles and Space Division 
1123 North Mathilda Avenue, Sunnyvale, California 


This paper describes a method of reducing 
the error in certain analog computer calculations 
by the negative gradient technique vhere one has 
additional information about system performance 
in the form of constraint equations. In general, 
the technique leads to correction terms which are 
introduced into the system differential equations. 
In some cases, however, simulation is obtained by 
operating directly with the constraint relations 
themselves . 

The theory is developed and several ex- 
amples discussed at some length in order to show 
both the manner in which the corrections may be 
introduced, and the character of the corrected 
solutions . 


Theory 


Let the equation set I represent the system under 
investigation; i.e., it is the basic differential 
equation set describing the physics of the situa- 
tion. Since any nth order differential equation 
may be written in the form of n first order equa- 
tions no generality is lost and considerable con- 
venience gained by adopting this presentation,. 

The x r are independent or driving functions. 


y 1 -f 1 (x 1 ,x 2 ,...x p ;y 1 ,y 2 ,...y ii ) 

*2 " t 2(VV-V y l ,, 2'- , n ) 


K ‘ V W • • -V y i’ y 2> • • -V 


Let the equation set II describe the constraints 
in (or to be imposed upon) the physical system I. 


i 1 (x 1 ,x 2 ,...x pJ y 


1* J 2 J 


■v y 


yo*-*-y n ) 




II 


€ q * € q< X l> V * * * V*l'*2' * * ' y n jy l' y 2' * ' 


The set II is written such that e . equals zero 
for all j . Now since the x are considered to 
be independent variables or driving functions, 
any deviation of the e. from zero must be due to 
incorrect values obtained from the y^ in the sol- 
ution of set I, (where u *= 0 for y^ and 1 for y i ) . 


where 


* XL (O' 


( 1 ) 


j=*i 


Now for any given value of x^ and y^ we may find 
the change required in each of the y. m a k ing up 
E by considering E as a surface in Euclidean N 
space. The vector givixg the direction of great- 
est change in E is the gradient vector, VE. 

Since we are interested in the direction of great- 
est decrease of the function, we turn our atten- 
tion to the negative gradient, which has the 
components 

<1 * 

r oe. u = 0, 1 . . 

e, — for | (2) 

j=l J 


1,2,... n 


u 


u 


Then we may assign to each Ay^, where Ay. is 
defined as the correction term to be added, to the 
y. obtained from the solution of I, the value 


u 

Ay ± 


<1 

- Z 

>1 


Vj 




*i 


for 


0, 1 


1,2, . . .n 


(3) 


That is, we add to each y a value proportional 
to the component of -VE in the y. direction. K. 
is a weighting factor determined oy the importance 
attached to the particular e.. Depending upon the 
process of gradient determination and subsequent 
correction insertion one has either a true grad- 
ient method or steepest descent method. For the 
applications in this paper, the gradient is de- 
termined and the corrections added continuously. 
Thus no sequential series of steps is taken^a^ong 
one gradient before the next is determined. ’ 


If we add these corrections directly and contin- 
uously to the positional values of the y^, each 
member of the corrected differential equation set 
becomes 


y‘ = f 1 [x 1 ,x 2 ,...x :p ;y 1 ,y 2 , ...y^] + Ay ± 

7 i - J y]dt + Ayj. 

where y\J is the corrected rate term. 


w 


Note that Ay i is not the integral of Ay^ as each 
arises independently from ( 3 ). We call this the 
Ep correction of I. 


Since we wish to consider all the e. simultaneous- 
ly, we take as our optimization criteria the min- 
imization of a non-negative function E of e . 


This method is in general quite difficult to im- 
plement on an analog computer due to the formation 
of algebraic loops in the generation of the error 



(1-5) 
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terms. We may, however, take a different point 
of view and consider each to arise from a 
change in the defining rate of y^; i.e., we put 


dH^i) = 

3-1 




W 


for 


u * 0,1 

i -1,2, 



This also will give to each y^ a correction in the 
desired direction hut in quite a different mann er 
due to the integration process. Thus each member 
of the corrected differential equation set becomes 

y± ~ v • • V y i ,y 2' * * * y n^ + ^i tfi&i 

since when u » 1 in (5) a second derivative term 
in Ay^ arises which cannpt be inserted in y. with- 
out increasing the order of the system. We call 
(6) the E^ correction of I. 


For certain algebraic applications (see Example l) 
the system order is indeed increased, but in gen- 
eral., this is not desirable. Direct positional 
insertion may be used, however, so that both E— 
and E_ corrections are applied in the same system. 
In this case, (6) becomes 

y ± ** f i (x 1 ,x 2 ,...x p ;y 1 ,y 2 ,...y n ) + 4ar ± + &y ± (j) 
Example (4) considers this in some detail. 


Simply stated, equation (3) shows that corrections 
may arise for all functions and their derivatives 
which occur explicitly in the constraint equations. 
Equations (4), (5), (6) and (7) show how the cor- 
rections can be added to the system I. 


We shall next illustrate Ep and E_ corrections as 
applied to certain problems of a general nature 
aris in g in analog simulations. The Ep simulation 
is visually quite easy to obtain from the original 
system mechanization, since in general it only 
involves adding the correction terms into the 
several integrators. This, of course, enables 
one to compare corrected and uncorrected perform- 
ance quite readily. E correction on the other 
hand is not so straight forward, and as previously 
noted, generally gives rise to stability problems 
on analog equipment. 


Examples 

Example 1 

Consider the desired calculation 

y± -Jk. (1.1) 

X 1 

Utilizing the zero order constraint, form 

y lV X 2 “ e ( x -2) 

Taking 

y l * y l(°) + ^1 0-.3) 

Differentiating and applying (5) 

*1 “ -it<V " A«i ^ - -VA 


Thus for quasi- static x^, x p 


x X 1 




For x 1 > 0 we note the possibility of incorpora- 
ting It into the constant K^. This saves one 
multiplication; i.e., put 

JL (Ay-,) * _K n e i for x > 0 (1.6) 

dt x ^ x - L 


Table I shows the type of convergence to the cor- 
rect solution for several cases where x ■» x . 
Only the exponents are shown since the solution 
form is the same as (1.5). Note that for the 
case x 1 ** x * sin cot (1.4) will not give correc- 
tion until t becomes large enough to offset 
sin 2 wt . 

2w 

Further investigation of the. (non-linear) equa- 
tions (l.4) and (1.6) is required to establish 
convergence criteria for arbitrary and x„ . 
Equation (1.4) is mechanized in Figure (1.2; with 
the aid of the symbols in Figure (l.l) . 


TABLE I 
Using (l»4) 


Using (1.6) 


x^ and Xg constant: 

-w 

» Xg s t: 

-K^/5 

-YL^t 2 /2 

x l * Xg bs sin orb; 

K^ + _ sin 2 wt', 

~2~ 2b) 

Not applicable 
since x. is not 
always > 0 

X l“ x 2 “ A + sljl 

For A > 1 

-KJA t + t/2 + 2A/b) 

- K 1 [At+ i-( 1-cos wt) ] 

-2A/w coswt-l/4w sin2urtj 

In Figure 1.2 G is to be 

as large as possible 


since it effectively increases K, .x^,,sx ,N^ and N g 
are to be taken as quasi-static where N. and N p c 
are errors which exist at the points indicated. 
The instantaneous differential equation is 


*1 - -V‘l + A 31 ! + *2 


(1.7) 


which yields the instantaneous solution form 
y x - y 1 (0)e- K I 3, ? t + [^ Kl-e^ (li? 


Attenuation of the error through 
multiplication channel is as 5— 
second by . 1 

*A 


the first 
and through the 


Because of this noise attenuation it becomes 
practical to use division in forming special 
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functions; for example: 

Given y to form z * In y we note 
z « -i- 

y 


Note that 


R = x cos cp + y sin cp 


(3 .4) 


(1.9) 


Thus e, and e p enable us to find sin cp and cos cp; 
(3.4) determines R. 


Here z is formed by the division circuit, then 
integrated to give z. The method is useful pro- 
vided z is used ultimately in the equation for 
y (i.e. , the z integration is not open loop). 

A particular advantage obtains in that the range 
of the In function is determined merely by an 
initial condition on the integrator. 

Notice that when (1.6) is applicable, two independ- 
ent Xg/x 1 and xVx^ divisions are obtainable 
with one multiplier. For either (1.4) or (l.6), 
y is available, though greatly attenuated and 
noisy. 

Example 2 

Inverse function generators in the feedback loop 
of high gain amplifiers are frequently used to 
generate monotonic functions having steep slopes. 
The method is advantageous because of the re- 
ciprocal relation between the slopes. Approach- 
ing this problem from the point of view of a con- 
straint relation yields the following. 


Using (5) 

d 

dt 


(A cos cp) ~-2K.j_e.j_ 




O cos Cp 


5 -Ke. for R > 0 

d (A sin cp) e*-2K, e p d £ g 
dt 1 a sin cp 


(3-5) 


(3-6) 


* -Ke- for R> 0 

(which are mechanized in Figure 3*l) 

The previous examples have illustrated methods 
of using the system equations themselves to form 
error (or constraint) equations. Minimizing 
these led directly to a solution of the system 
equations . 

We shall next consider two examples of systems 
which are amenable to error correction in the 
more general sense of constraining the solution 
of I to satisfy the relations in II by the use 
.d/or Ep corrections. 


of Ep am 


Let y 
Now 


f(x) be the desired function, given x. 


f(y) 

Using (5): 


and e « f (y) - x 


be 


iL (Ay) - y ■ - Ke ”3 y 

dt 


( 2 . 1 ) 

( 2 . 2 ) 


r — t -1 2 

x we see that f( y ) *= y and 


Example 4 

Amplitude stabilization of the harmonic oscilla- 
tor using essentially Ep type correction has , 
been treated extensively in the literature 
However, as this second-order system affords a 
particularly simple vehicle for obtaining quali- 
tative solutions which aid in understanding the 
roles of the correction terms, it will again be 
considered here. To this end, then, let it be 


y *= -2Key; y(0) « ^x(O) 

(2*3) 

desired to generate the sine and cosine of 
angle cp where $ is given. 

the 

2 

e « y - x 


(2.4) 

y 2 ■ sin cp -► y 2 = <p cos cp 

(4.1) 

Performance is good except 
mechanized in Figure 2.1. 

near x = 0. 

(2.3) is 

y.j_ = cos cp -*■ y^j_ = -(f) sin cp 

(4.2) 




Thus the system equations are : 


Example 3 



£ 

H 

(M) 

The problem of finding R, sin cp and cos 

cp given 

y-, ~-$y 0 

&*) 

x and y frequently arises in coordinate 

trans- 


formations. When <p is not 

available and the 

The constraint relations we adopt here are: 


frequency too high for standard circuits, the 

2 2 

(^•5) 

following method has been used with success. 

e l “ y l + y 2 ' 1 

Since 2 2 2 



and e 2 « y^ - y^ - 4> ■ ^ 

(4.6) 

x + y « R 





x *« R cos cp 


(3-1) 

e 2 is a measure of system rate error, since 

y m R sin <p 



f tan" 1 y 2 U y l y 2„- y 2 y l - <j> 

dt L -7iJ 

(^•7) 

we put 


(3-2) 

€jj_ « R cos cp - 

X 

Thus . 


e 2 «= R sin 9 - 

y 

(3.3) 

y 1^2 ' y 2 y l * ^(^i + ^ 

(4.8) 
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and 

y l^2 “ y 2^1 “ $ * $ e i 

• (M) 

Using 

to obtain Ay 1 and /sy^: 



&1 “ ~ SK oi e ih 

( 4 . 10 ) 


^2 “ " 2K C)2 € l y 2 

(4.11) 

Using 

€p to obtain “id Ay„: 



" *11^2 

(4.12) 


^2 “*12^1^1 

(4.13) 


The weighting functions are deliberately identi- 
fied in the manner shown in order to determine 
the effect of mismatch; always an Important con- 
sideration in analog work. 

Using (7), the corrected system equations become: 


y 2 * 4^1 + A/ 2 + A/g (4.14) 

y x *= q>y 2 + (4.13) 

Carrying out the work assuming quasi-static e 1 
(which amounts to treating it as sign invariant 
during the instantaneous integration process) 
yields 


y 2 » Ae -0 ^ sin wt 
y 1 *» e -0 ^ sin (art + g>) 

where 

A = 1 + (^ii " ^12^ 

Q “ ^ K 01 + ^2^1 
w - 4>[1 " + K^)] 

cp *= tan ■*" 

e l^ K 02 " **01 


(4.16) 

(4.17) 

(4.18) 

(^.19) 

(4.20) 

(4.21) 


and where second-order terms have been dropped. 

An analysis of (4.l6) and (4.17) shows that the 
system process is such as to decrease the absolute 
value of e, . (4.18) and (4.21) give the detri- 

mental effects of mismatch. Note that damping 
arises only from El, and rate correction (to first 
order terms) from E p . 


The mechanization of (4.3) and (4.4) with E_ 
corrections arising from e-j^ is shown in Figure 4. 1 . 
Stable frequencies in excess of 150 cps have been 
generated using this technique. 

Example 3 


This final example will treat with the compound 
pendulum of Figure 5.1 having a total angular 
excursion of * j/2 radians. The non-linear 
differential equation describing this system 


may be’ solved by elliptic integrals. 

Although quite simple to mechanize on an analog 
computer, the errors after 5 seconds are pro- 
hibitive for a pendulum having a 5 cps frequency. 
The introduction of two constraint relations is 
necessary to preserve solution integrity. 



Figure 5*1 


m * mass of pendulum 
1 * length of pendulum 
0 * point of suspension 
C m center of mass 
k * radius of gyration 
about 0 
a * 0C 

mg* force due to gravity 


Parameters and Definitions 


m * 1 gram 

A = 

. ga 
ko^ 

1 = 1.2 cm 

B = 

1/2 mk^ 

a = .346 cm 

G * 

mga 

k Q 2 * .2394 

y 2 = 

sin 9 

2 

g * 980 cm/sec 

y r= 

cos cp 


These parameters give a 5 cps frequency for an 
excursion of + it/2 radians. Initially the pen- 
dulum is at cp * + tc /2 which is defined as the 
point of minimum kinetic energy. 

The energy equation is 

B$ 2 + G(l - cos q>) » G (5-l) 


Differentiating this leads to the system equations 


9 at to 

w =-Ay 2 

where y g and 9 are related by 

y 2 * 9y x 
*1 =-q>y 2 


(5.2) 


(5-3) 


y is the driving function for (3.2). Any errors 
arising in w and <f> (we do not employ q> in the 
mechanization) arise from the product Ay g and the 
subsequent integration. 


9 is the driving function for (5*3) . We know 
that amplitude stabilization of (5*3) is desir- 
able, but here (5*l) imposes yet another con- 
straint upon the y^ formed in (5*3) • 

Using both the total energy and the amplitude 
stabilization relation, we define 



e i - A + 4 - 1 
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(5-4) 

e 2 “ B<p^- Gy 1 (5-5) 

from which one obtains (using 3) : 

Aj *s 0 

Aw * Aq> e-aK^Bq) (5*6) 

Acp * - K 2 e 2 y 2 G 


Ay 2 » Ay 1 » 0 
Ay 2 ='2K 1 e 1 y 2 

Ay l ,= ' 2K L e l y l + * 2 * 2 ° 


Using (7): 

4> * w + Aq) + A4> 
w = -Ay 2 + Aw 


y 2 = ^ + z^ 2 

y l~ -^2 + 


(5-7) 


(5-8) 


(5-9) 


Note that an attempt to include either Ap or Af> 
in the <f> equation leads to an unstable algebraic 
loop since e 2 contains <p. 


factors as large as possible in order to hasten 
the (less efficient) minimization process. 

Thus, except in simple systems where no incon- 
sistency in units arises for the correction terms, 
we shall follow this rule. 


This has been done for (5«10) and (5 .11) which 
are mechanized in Figure (5 *2) .Deviations from 
the theoretical solution were monitored for a 
variety of cases and the results are given in 
Plates I and II. 

Plate I shows the results after 5 seconds of 
solution time of 

1 . No correction 

2. Amplitude correction into the cosine only 

3. Amplitude correction into the sine only 

4. Amplitude correction into both the sine 
and cosine 

5. Correction in the potential energy only 

6. Correction in the kinetic energy only 

7. Correction in both kinetic energy and 
potential energy. 

All of these solutions had errors growing with time. 

Plate II gives the results of applying corrections 
arising from both e, and e 2 . All of these solu- 
tions are stable, but notice that they may be 
substantially in error unless the corrections are 
added as specified by (5 -10) and (3.II). 


With this restriction, the corrected system is 


<p «s w 
w «^A y 2 - 


( 5 - 10 ) 


j 2 = <f> yi - ec^yg 

y l~ "$ y 2 ~ 2K l y i e l + K 2 e 2 G 

Any attempt to implement (6.10) and (6.11) leads 
immediately to scaling difficulties due to the 
disparity of B and G. Further, the units of aw 
and A5, make impossible a consistent definition 
in the units of K^. Actually, this is to be ex- 
pected for our treatment of K, has neglected any 
mention of u and i which certainly must be con- 
sidered if any rational system of units is to be 
maintained in the several corrections. 


The dilemma confronting us is quite fundamental, 
for if the K. are to be determined by a consider- 
ation of i add u as well as j, what are to be the 
weights attached to them? Unless this is known, 
the corrections will not lie along the negative 
gradient as desired. We do know, however, that 
these newly considered constants will all be 
positive. Thus, no matter what weights are 
attached to them, the resulting correction vector 
will decrease E though certainly not in general 
along its negative gradient. For small E this is 
probably not significant, but in any case the 
remedy seems to be to make all the K. and their 

J 


In accordance with the theory of keeping K. as 
large as possible here, it was noted that decreas- 
ing L or A degraded system performance. 
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Hans F. Meissinger 
Hughes Aircraft Company 
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Summary 


A new computer technique is described which 
yields the partial derivatives of problem vari- 
ables with respect to pertinent system parameters 
simultaneously with the solution of the original 
system differential equations. These derivatives, 
known as parameter influence coefficients, are 
valuable to the analyst in enhancing his under- 
standing of system characteristics. If the prob- 
lem solution x(t,A,) and the parameter influence 
3x 

coefficient ^-(t,X) is known for a particular 

operating point where \ = X Q , then it is possible 
to make a first order prediction of system behav- 
ior at a neighboring point having the new param- 
eter value = X Q +^X. Similar predictions can 
be made if not one but several parameters are to 
be varied. Thus, the knowledge of parameter in- 
fluences often helps to reduce the total number of 
computer runs required in a parametric system 
study. Typical applications of the technique are: 
linear extrapolation in the neighborhood of a 
known solution; determination of design tolerances 
of a system; prediction of critical parameter 
values and stability boundaries. The mpst useful 
application pertains to systems disturbed by ran- 
dom noise where normally a very large number of 
computer runs would be required to analyze the 
system on a statistical basis in a variety of 
operating conditions. Several illustrative 
examples are presented in the paper. 


Introduction 


In the analysis of dynamic systems it is fre- 
quently required to determine response character- 
istics not only for selected operating conditions 
but rather for a range of conditions over which 
certain system parameters can vary. If the sys- 
tem is described in terms of a set of differen- 
tial equations 

dx i 

dt“ = f iW> X 2* t; a V a 2* **•)* 

i = 1, 2, ..., n (1) 

where a^, a^, ..., are system parameters of 
particular interest, then any information obtain- 
able from the computer which serves to enhance 
one's knowledge of the system response as function 
of these parameters will be valuable. Let the 
solution of the differential equations (l) ob- 
tained for a prescribed set of parameters and ini- 
tial conditions be expressed as 

Xi o =Xi Q (t; G!p Gig, •♦•), i = 2, • ••, n (2) 


Then the partial derivatives 

9xj ^x,* px,- 

±o 1 o • L o 1 

9 a. ’ $ a 0 ' TcT > n 

12 m 

which will be referred to as parameter influence 
coefficients will be of considerable interest. 
They can be used to predict system performance in 
the neighborhood of the known solution xi Q by 
first order approximation and to describe system 
sensitivity to certain design changes. If a 
parametric study of the system is required, usu- 
ally a time-consuming task even on modern elec- 
tronic computers, the parameter influences help 
reduce the total number of computer runs which 
must be made to explore a specified region of 
the parameter space. The situation is analogous 
to the simple task of drawing a curve either 
through a set of given points or through a set 
of points at which tangential slopes are known. 

In the latter case a much smaller number of 
points is needed to perform the task with the 
same degree of accuracy. How many data points 
may be saved through the additional information 
contained in the tangential slopes is a question 
which cannot be decided a priori; this depends on 
the nature of the curve and on the spacing of the 
points themselves. Evidently, the parameter in- 
fluence coefficients give the analyst some degree 
of insight into trends of performance variation 
and can guide him in the search for parameter 
values of particular interest, for example those 
parameter combinations which promise greater sys- 
tem stability, greater yield of a chemical proc- 
ess, or greater effectiveness of control, as the 
case may be. Such orientation is especially de- 
sirable where many parameters of interest must be 
varied, and where intuition fails to indicate a 
clear direction for an experimental search pro- 
cedure. 

This paper discusses a method for obtaining 
parameter influences by solving a set of auxil- 
iary differential equations, known as sensitivity 
equations, simultaneously with the original sys- 
tem equations. These sensitivity equations are 
derived from the original equations by a simple 
differentiation process prior to programming the 
computer. Obviously then, the additional infor- 
mation sought can only be gained at the expense 
of greater program complexity. However, the ad- 
vantages of this trade-off will become apparent 
in the following sections. 

The use of sensitivity equations was pro- 
posed some time ago by Murray and Miller^ in con- 
nection with error analysis of solutions obtained 
by analog computers. The theory developed by 
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these authors has a direct bearing on the subject 
of this article, although the objectives and the 
field of application have been broadened consider- 
ably in the present discussion. In practice, the 
concept of sensitivity equations has remained 
largely unused, perhaps owing to the specialized 
application for which it was originally suggested. 
In an article published recently by Margolis and 
Leondes^ a sensitivity equation is used in devel- 
oping an adaptive control system capable of 
sensing parameter influences for feedback pur- 
poses. 

The purpose of this paper is to indicate, in 
terms of some typical applications, the practical 
usefulness of the parameter influence method and 
to stimulate further development in this direction. 
With the aid of this method a computer study in- 
volving parameter variations can be organized to 
follow a more systematic search pattern than has 
been possible heretofore. The need for an im- 
proved design of computer experiments is being 
felt throughout the computer engineering field. 

In principle, the parameter influence method which 
promises to yield such an improvement is appli- 
cable to digital as well as analog computers, al- 
though it will be presented here only in terms of 
analog operations. Time savings may prove even 
more significant in the digital field in view of 
the relatively large amount of time required for a 
single solution. 

The following list of possible applications 
of the technique to studies of dynamic systems 
will illustrate its usefulness: 

1. Prediction of solutions in the neighbor- 
hood of a known solution by a linear extrapolation 
method. 


2. Determination of parameter tolerances on 
the basis of linear prediction. Detection of 
critical parameters. 

3. Application to statistical studies: 
Evaluation of the influence of randomly distrib- 
uted system constants or initial conditions. 
Extrapolation of results obtained with random 
noise signal inputs. 

k. Optimization of system parameters in 
accordance with specified performance criteria 
using gradient techniques. 

5 . Analysis of solution sensitivity to com- 
puter errors. 

6. Determination of system stability 
boundaries. 

7 . Variation of the time of occurrence of 
specified conditions, variation of rise times, 
settling times. 


8. Solution of boundary value problems in 
ordinary differential equations. 

Although only a few of these applications 
can be discussed in the space of this article the 
computer approach will be evident from the 
examples chosen. The emphasis of the discussion 
will be placed on-underlying mathematical consid- 
erations. Some limitations of the method will be 
noted, and areas requiring further research 
effort will be indicated. 

Derivation of Parameter Influence Coefficients 


The principle of the technique will be ex- 
plained first in terms of a simple second order 
differential equation. It will then be general- 
ized for application to more complex systems. 

Consider the inhomogeqeous linear equation 

in x 


+ + »*>f(t) (3) 

dt 

with specified initial values x(0) = a and 
dx 

tj£-( 0) = b. The solution x is desired for several 

values of the parameter X and may be considered as 
a function of two variables, x = x(t,x). From a 
known solution curve obtained for one parameter 
value X Q a neighboring curve for X^ can b e deter- 
mined by extrapolation in terms of AX = X^ - X Q . 
Thus 


f 3x\ ... 

/3 1 2 3 * 5 6 7 *) 

(3X/X 0 * X + 



Ax 

2 ! 


+ ...(k) 


The number of terms needed in this expansion for a 
satisfactory approximation of ?c(t,X^) depends on 
the magnitude of AX and on the behavior of the 
solution x and its partial derivatives with re- 
spect to X in the region of interest. Only first 
order extrapolations will be considered here. 

The partial derivative 3x/9X, itself a 
function of t and X, is a parameter influence 
coefficient of first order. Other parameter 
influences pertaining to equation ( 3 ) are the 
derivaties dx./9/u , <?x/ 0a, and 3x/3b, the 

latter two terms representing the sensitivity to 
changes in initial values. 


A simple operation performed on the differ- 
ential equation ( 3 ) makes it possible to program 
the problem for simultaneous computer solution 
of x and 3x/3x. Differentiation of ( 3 ) with 
respect to X yields** 


3x3t 2 


+ P 


&L. 

3x3t 



+ x - 


0 


(5) 


* The referenced article makes use of some earlier 

unpublished notes by the author3 in which the 
parameter influence method was discussed. 


** Since x is a function of X as well as t, the 
symbol 3 / d t must be used in place of d/dt in 
formulating equation ( 5 ) • 



183 

6.3 


Interchanging the order of differentiation of x 
and using the notation u = 9x/2x, one obtains 
the differential equation in u 


Proceeding now to the general problem of an 
nth order system of differential equations with 
m parameters a^, ccg, . .., 


9 g u . 9u 


( 6 ) 


whereumust satisfy the initial conditions u(0)=0, 
dxi 

^“•(o) = 0, since the initial values of x and 9x/ 9 1 

do not depend on X. Equation (6) is known as the 
sensitivity equation of the system with respect to 
the parameter X. Similarly, if the parameter 
influence of (A is desired, a new equation in 
v = dx/d/Uf viz. 


32v 


dv 




(7) 


is derived, where v obeys the initial conditions 

v(0) = 0, |~Co) = 0. 


In the simple case considered here the homo- 
geneous part of the sensitivity equations is 
identical to that of (3). The terms -x and 
- 0x / 2 t which appear as forcing functions in 
equations (6) and (7), respectively, provide the 
coupling of the sensitivity equations to the orig- 
inal equation (3). There is, of course, no 
coupling in reverse. Figure 1 shows the computer 
circuits which yield x and u; the two circuits may 
be interpreted as master and slave, respectively. 
Owing to the identical structure of (6) and ( 7 ) 
only one slave circuit is necessary to obtain u 
and v. These functions can be computed in succes- 
sive operation, simply by using either x or 
9x/ 2 t as the coupling term. 


dXi 

d r = f i (x l' *2' 


t ; 0!^, Ci^f . . . ) , 


i - 1, 2, . . . , n (l) 

the sensitivity equation with respect to yield- 

0Xi 

ing the derivative u^ = may be formulated as 
follows 


jL(ti) 

?a k l at / 


2 u ik 9f± 3fi 9f ± 

u lk + ^ u 2k + *”+£a£ 


i = 1, 2, . .., n 


(9) 


The initial conditions are u^ k (0) = 0 provided 
none of the initial values of the original differ- 
ential equation are considered as parameters. 

This formulation applies to linear as well as non- 
linear systems. 


It will be observed that in order to study 
the influence of one parameter a k it is necessary 
to program a complete set of n sensitivity equa- 
tions (9) even though this parameter may appear 
explicitly only in one equation of the original 
system (l). If, for example, Cfo occurs in the 
term fj only, then the j fc h sensitivity equation 
exhibits the coupling term 'Sfj/^Qk whereas 
9fi/9a k = 0 for i ^ j. The remaining sensitiv- 
ity equations nevertheless contain the influence 
of a k implicitly in terms of the variables u^ k , 
u 2k> ••• which establish intercoupling with 
the jbh equation. 


If the initial conditions a and b are the 
parameters of interest it is seen that no coupling 
terms appear in the sensitivity equations. In the 
case of #x/$a = w the homogeneous differential 
equation 


92 w 
9t^ 



+ 


Xw 


0 


( 8 ) 


Q 

with initial conditions w(0) = 1, ^(O) = 0 is 

obtained which can be solved simply by operating 
the original computer setup once more but without 
the driving function f(t) and with appropriately 
modified initial values. 


It should be noted that the use of the param- 
eter influence method is not restricted to linear 
systems. If, for example, the original differen- 
tial equation ( 3 ) is augmented by the term Vx3, 
the sensitivity equation with respect to y assumes 
the form 

^ ^ + 2 v X 2 Z + x5 = 0 (6a) 

where z * 9x/Py . The coupling terms in this 
example are, themselves, nonlinear functions of 
the dependent variable x, and the structure of 
the secondary computer circuit differs from the 
primary one. 


These facts are illustrated by a simulation 
problem involving missile guidance. Figure 2 
shows a block diagram of the system containing a 
nonlinear feedback term F(9;k) representative of 
radome refraction of the line-of-sight' from 
missile to target. The influence of parameter k 
upon guidance accuracy is to be investigated. The 
problem variables V, ^ , 8, y, a t a a and q are 
defined in the diagram; k is called radome error 
slope. It is seen that the computation of the 
parameter influence of k involves only one 
coupling term 9F/dk between the primary and the 
secondary simulation loop, whereas the partial 
derivatives of all problem variables with respect 
to k, i.e., V k , 7*k * ®k> etc., are required for 
completing the computer program as indicated in 
the bottom half of Figure 2. In the guidance 
study depicted here the statistical effect of 
radar noise on missile dispersion at impact is 
analyzed. Since a large number of computer runs 
is necessary to make statistically significant 
predictions of rms-miss for any given value of k, 
a parametric study of the influence of k on miss 
M proves very time-consuming. Using the parameter 
influence technique for determining miss variation 
with k a linear extrapolation from computed sample 
points will result in a substantial reduction of 
time required for conducting this study. 
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From a standpoint of economy of computer 
operation the technique would not seem to be very 
attractive since approximately twice the ordinary 
amount of computing equipment is required to 
obtain one set of parameter influences, 9 x ^/9 0^, 
together with X£. Fortunately, however, a slight 
modification of these circuits can often produce 
other influence coefficients 9 x-[/ 9 aj if the 
parameter ctj occurs explicitly only in one or a 
few of the original equations. “The time saved in 
parametric studies of systems having random noise 
input signals nevertheless may justify this tech- 
nique in spite of increased program complexity. 

A problem of theoretical as well as practical 
interest in any application of the parameter influ- 
ence technique is the possible occurrence of res- 
onance phenomena when the sensitivity equations are 
coupled to the original system of equations 1 . In 
the simple case presented by equations (j), (6) 
and (7) it is seen that the secondary system is 
excited by the output of the primary system having 
the same natural frequency cd = VX The secondary 
system will begin to oscillate in resonance if the 
damping coefficient is small. For systems exhib- 
iting neutrally stable or unstable modes of oscil- 
lation the computer circuits for the sensitivity 
equations can therefore be operated for a limited 
period of time only bevore an overload condition 
is reached. The technique remains applicable 
until this time. However, the following consid- 
erations must be kept in mind: As the amplitude 
of the parameter influence coefficients increases 
with time, the instantaneous values of the orig- 
inal solution x(t) become more sensitive to param- 
eter variations. Furthermore, higher order param- 
eter influences which are subject to the same 
resonance effects tend to become relatively impor- 
tant. Hence, the quality of extrapolation based 
on first order parameter influences is degraded in 
any system exhibiting unstable characteristics. 
This fact will be considered in greated detail in 
the next section. 


Some other typical applications of the param- 
eter influence method merit further discussion. 
Examples considered in this section are selected 
from the topics listed in the Introduction and 
include extrapolation from a known solution curve, 
optimization of system parameters, and determina- 
tion of stability’boundaries by systematic search. 
To simplify the mathematical discussion the second 
order linear differential equation (3) will be 
considered again in these examples; the forcing 
term f(t) will be omitted. Using dot symbols to 
denote time derivatives the system equation and 
sensitivity equation are, respectively. 


x + |UX + Xx = 0 

( 3 a) 

U + |W u + \u = -x 

(6) 


subject to initial conditions x(0) = a, x(0) = 0, 
u(0) = 0, u(o) = 0, where as before u is defined 
as u = 9 x/ ^ 


Extrapolation 

Results of extrapolation from a known solu- 
tion obtained on the computer for the value X, Q are 
shown in Figure 3 as a family of output curves 
with \ as parameter. The diagram also shows 
accurate solution curves obtained by actual param- 
eter variation in the system. Similarly, Figure 4 
shows a family of curves obtained by variation of 
the parameter /W, starting from a selected value 
f^ 0 . In both cases a high quality of approxima- 
tion is observed over a limited time interval 
depending on the magnitude of the parameter 
changes A \ and A/W . The time -dependent increase 
in approximation error is inherent in the nature 
of the first order extrapolation scheme. A simple 
discussion, applying to the case of undamped 
oscillations ( £( = 0), will help to clarify this 
fact. The solution has the form 

x = a cos cut (10) 


Other Applications of the Method 

The use of parameter influences in the anal- 
ysis of statistical problems involving random 
noise disturbances was described in the preceding 
paragraphs. If the initial values or some param- 
eters of the dynamic system are known to be ran- 
domly distributed in a specified manner, a 
prediction of the statistical characteristics of 
the output functions at any instant of time is 
possible on the basis of parameter influence coef- 
ficients. For example, the mean and standard 
deviation of the miss distance of a projectile 
corresponding to random dispersion of launch con- 
ditions may be obtained by a single computer run 
through first order extrapolation from a nominal 
flight path. The accuracy of such a prediction 
will be influenced by the interval over which the 
initial values or parameters are expected to vary, 
by the time of flight, and by the problem sensi- 
tivity in general. Spot checks of the accuracy in 
typical cases will be required. 


where cd = {T. The sensitivity equation with 
respect to cd thus becomes 

u + cd2u = -2 cd x = -2 cd a cos cut (ll) 

and yields the parameter influence 

u = -at sin cut (12) 


A first order extrapolation centered at cdq would 
yield the approximate solution 

x e (t,co) = a cos £D 0 t - at(cD - CD 0 )sin CD 0 t (13) 


where the higher order terms of the power series 
expansion with respect to (cd - CDo) are neglected. 
The error of this approximation involves the terms 


(cd -CDq) 2 35x (cd-cdqP 

2! + doy> y. 


. t^D-CDo) 2 t^O-CD^ 

= -a — ‘-coso^t+a smco^ +• • * (14; 


6 
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The time interval during which the second order 
error term remains within specified bounds is seen 
to be inversely proportional to the frequency dif- 
ference (cd - 0) o ) used in the extrapolation pro- 
cedure. Boundaries of fixed approximation errors 
in a frequency versus time diagram are illustrated 
in Figure 5* 

Parameter Optimization 

An application of the method to optimization 
of system parameters is illustrated by the process 
of matching the output x(t) of a system with two 
unspecified parameters against a known function 
x^(t) in terms of least-square errors. This prob- 
lem arises, for example, if test results obtained 
in the field are to be duplicated by laboratory 
experiment with a corresponding mathematical model 
of the system having adjustable parameter settings. 
Once again, the second order equation (ja) is used 
as model; the parameters [A and X are assumed as 
unknown. 

The optimization criterium is given by 
T 

h( f*,X) * J' fx(t ; ^U,X) - x i (t)] 2 dt (15) 

0 

To minimize this function a gradient technique^>5 
can be employed where the values of ft and X are 
varied in proportion to the respective components 
of the gradient vector 7h(f<,x). This technique 
assures a steepest descent to the minimum value of 
h along a path perpendicular to the contours 
h(fz ,X) = constant in the ^W-X-plane. 

In order to compute the gradient components 
T 

% = 2 f [ x(t ^' X ) " x i (t) ] % dt (16) 
0 

T 

8=2 f [*(t;f*,x) - Xi(t)]|jdt (17) 

0 

the parameter influences 9x/ 9 (A and 0x/0X are 
required. They are obtained as continuous 
functions of time together with the solution x(t). 
The computer simultaneously evaluates the inte- 
grals (16) and (17). The sequence of operations 
consists in a systematic step-by-step variation of 
parameter values fu and X in accordance with the 
gradient direction established at the end of each 
computing cycle. The process converges to the 
desired minimum of h. Figure 6 shows typical con- 
tours and gradient lines obtained in the solution 
of this optimization problem. 


analysis. The technique described here provides 
a systematic search pattern for repeated trials 
on the computer. Predictions based on parameter 
influence coefficients are made before each new 
trial run to assure rapid' convergence of the 
trial sequence to a point on the stability bound- 
ary. Only one parameter is varied in this 
sequence of operations. 

In the case of a damped oscillatory system 
one may proceed by selecting two or more succes- 
sive maxima or minima of a solution curve and the 
corresponding values of the parameter influence 
coefficient available from the computer. Pre- 
dicted variations of these maxima or minima are 
then plotted against the parameter in question, 
and a point, or a cluster of points, of inter- 
section of the different extrapolation lines is 
determined. This intersection designates the 
parameter value at which, according to first 
order prediction, the successive maxima or minima 
would have remained equal. Thus, a first approx- 
imation of the stability boundary is obtained. 
Subsequent trials are made, if desired, to yield 
successively better approximations. It can be 
shown that this step-by-step approach is, in 
fact, an example of Newton's iteration procedure 
and converges rapidly to the desired boundary 
point, under certain conditions regarding the 
initial choice of the parameter value. The nature 
of this convergence will be discussed in the 
Appendix. 

An example of the iterative approach is pre- 
sented in Figure 7 which shows solutions of the 
linear differential equation (j>&) having the 
damping factor (A as parameter. Starting with a 
positive damping value, 0.10, the sequence of 
solutions proceeds rapidly to the vicinity of the 
boundary case where /Ug = 0. The example indicates 
that the first iteration overshoots to a slightly 
negative value of [A ; this error is almost com- 
pletely eliminated in the second step. Figure 8 
illustrates the extrapolation graphically in 
terms of predicted maxima plotted versus^. It 
is seen that the initial choice., fJ -j. = 0.10, is 
corrected by the incremental change A/Mg = - 0.116 
which gives rise to the slightly unstable second 
trial solution shown in Figure 7* 

In general, the successive parameter incre- 
ments Aft to be used for the i th trial run may be 
computed in accordance with the extrapolation 
scheme illustrated in Figure 8. Depending on 
whether the intersection of two or more extrapo- 
lation lines are to~ be taken into account, the 
procedure is formulated as follows 

(is) 

L v n v n+l J 


Stability Boundaries 

The problem of finding stability boundaries 
of a dynamic system in the parameter space arises 
in many phases of control system design and 


for one intersection point 


i 


n y XmaXn ~ x max n+1 ~ | 

|_ N n^l v n " v n+ i J i _ 1 


(19) 


for N intersection points 
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where x maXn , Xmax n +1> etc * denote consecutive 
maxima and v n , v n+1 , etc. the corresponding param- 
eter influence terms. These expressions will be 
derived in the Appendix. 

In systems where two or more parameters 
define the stability boundary the process must be 
repeated for various parameter combinations. At 
each of these combinations the prediction of the 
boundary point is performed in terms of the same 
parameter since the computer program contains the 
sensitivity equation (or equations) for that 
parameter. 

Applications Involving Time of Occurrence 

A type of problem frequently encountered in 
computer applications involves the time of occur- 
rence T of a specified condition and the variation 
of T as a result of parameter changes. Examples 
are: settling time or time of occurrence of the 
first maximum of a transient in the study of con- 
trol system response; time of the nth zero- 
crossing of a non-monotonic or oscillatory vari- 
able; boundary -value and eigenvalue problems 
where a parameter must be adjusted until the 
solution meets a specified end condition. In a 
manner similar to the systematic search for 
stability boundaries described above, the param- 
eter influence technique may be employed to reduce 
the number of computer runs which would normally 
be required in a trial and error procei re. 

Let the problem solution x = x(t,k) attain a 
specified value C at the time t = T. Then the 
equation 

x(T,\) = C (20) 

is an implicit representation of the time of occur- 
rence T as a function of the parameter k. From 
the theory of implicit functions one obtains the 
result 


9x 



at 


which may be directly applied to predict the neces- 
sary change in \ to alter T by^T on the basis of. 
known values of u and x, provided the derivative x 
does not vanish. The variables u and x are 
assumed to be available in the computer program. 

The first order extrapolation represented by equa- 
tion (21) can be used in an iteration procedure 
designed to converge to the desired final value of 
T. 

Conclusions 

Computer programming techniques for obtaining 
parameter influence coefficients have been dis- 
cussed, and applications which illustrate this 
approach to the analysis of system performance 
have been presented in the foregoing sections. 

The new method promises more effective utiliza- 
tion of the computer by reducing the effort 


required in conducting comprehensive parameter 
studies or in searching for optimum parameter 
combinations of a system. Savings in machine 
time and operators' time are realized through a 
greater yield of useful information from each com- 
puter run. The cost of additional program com- 
plexity or increased number of computer channels, 
however, and the greater burden of checkout and 
trouble-shooting in the case of analog operations 
must be weighed against the advantages gained by 
the method. In statistical evaluation studies of 
dynamic systems, known for being extremely time- 
consuming, the method offers definite promise. In 
digital computation, where a search for optimum 
solutions, stability boundaries, etc., cannot con- 
veniently be guided by the analyst without much 
loss of time, a systematic search pattern based on 
parameter influence data is particularly attrac- 
tive. Development along these lines is currently 
in progress. In view of the limited experience 
gained in applying the method to examples of the 
type described above, a conclusive evaluation of 
net advantages cannot be made at this time. 

Judging strictly from a viewpoint of time 
savings, it appears that the analog computer 
field can profit more immediately from increased 
operating speeds available in certain types of 
computer models. Repetitive analog computers, 
operating at rates of up to 50 solutions per 
second, are capable of handling statistical evalu- 
ation studies in a remarkably short time. It is 
expected, nevertheless, that parameter influence 
techniques will find increased application on 
analog computers of the conventional real-time 
variety which are available in much larger 
numbers . 

Mathematically, the question arises whether 
parameter influences cannot be equally well 
obtained without increased program complexity by 
a finite difference approach. Such an approach 
requires pairs of computer solutions in close 
vicinity to approximate the partial derivatives 
in question. This procedure is feasible, if 
somewhat more time-consuming, except in the case 
of random noise studies. Here the finite differ- 
ences obtained from two adjacent samples of runs 
would be made highly inaccurate by the confidence 
intervals attached to the individual results. 

The question of extrapolation accuracy, dis- 
cussed in the sections on Derivation of Parameter 
Influence Coefficients and Other Applications of 
the Method, is of fundamental interest and must 
be considered in each individual application. In 
theory, higher order parameter influences could 
be used to reduce the error inherent in first 
order prediction, and these terms could be 
obtained on the computer in a manner similar to 
that described for the first order influence 
coefficients. In practice, this procedure leads 
to a prohibitive increase in program complexity 
and is not considered justified since accuracy of 
prediction is not the primary purpose of the 
approach. 
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Appendix 


Convergence of the Iterative Method for Finding 
the Stability Boundary of an Oscillatory System 


The iteration procedure, illustrated by 
Figures 7 and 8, for finding stability boundaries 
will be investigated in this Appendix, and cri- 
teria for its convergence will be discussed. Equa- 
tions (18) and (19) which give the incremental 
change A/-^ of the parameter f. A after the (i-l) st 
trial will first be derived. The predicted 
(J. -value at which two consecutive maxima, Xm aXn 
and Xmaxn+l, become equal is found at the inter- 
section of the two lines of extrapolation 

— I 

= ^max + | fy*- 

x n+l = ^maxn+i + 0^1 J ty* (Al) 

Hence, by equating these expressions and using the 
notation 9x/2^+ = v, one obtains 

x max n + v n A - x max n+ ^ + v n+l A (* (A2) 

from which the equation 

as) 

rX L v n “ v n+! Ji.! ' 


immediately follows. If more than two consecutive 
maxima are used in the extrapolation, an average 
of the predicted (distinct) intersection points is 
formed which yields equation (19) as a generalized 
form of (18). For linear oscillations of the type 
depicted in Figure 7 it is advantageous to use 
minima as well as maxima in the extrapolation pro- 
cedure in order to obtain twice as many sample 
points from each computer run. The minimum Xmin^ 
and the corresponding slope v^ are used with 
appropriate sign reversals when a comparison with 
the preceding maximum is made. The algorithm 
equivalent to equation (18) thus takes the form 



+ Xminn+il 

+ v n+l J 


i-1 


(A3) 


where the subscripts n, n+1, etc., refer to con- 
secutive extreme values irrespective of their 
character as maxima or minima. This modified 
prediction formula was used in obtaining the 
results shown in Figures 7 and 8. 
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The nature of the iteration procedure can be 
more readily interpreted after introducing the 
simplifying notation 


a n “ x max n “ Xmaxn+l 


m 


For any given subscript n the difference term a n 
is a function of fJ. only. Using this definition, 
a simple expression for the slope difference 
v n - v n+ i can be derived, viz.. 




n+1 



( x max n “ x max n+1 ) 



Hence, the prediction formula (18) may be 
rewritten in the form 


(A5) 


A/^i = 



J i-l 


(A6) 


which is Newton's familiar iteration formula for 
finding the zero of the difference function a n (yu). 
Thus, the successive extrapolation for finding 
the stability boundary point a n = 0 proves to be 
nothing else but an application of Newton's iter- 
ation procedure. 

Figure 9 shows typical a n -curves plotted 
against (A for n = 1, 2, 3 and A. The curves were 
obtained by comparing consecutive amplitudes (i.e. 
maxima and minima) from a family* of solutions 
x(t,^t ) of the linear differential equation 

x + yu x + Xx = 0 (3a) 

with x(0) = a, x(0) = 0. 

It is important to determine over which 
range of yut the iteration process can be expected 
to converge to the stability boundary located at 
^B = 0 in the case under consideration. Inspec- 
tion of Figure 9 indicates that convergence to 
this point is limited to values of yu at which 
the a n -curves have a positive slope since the 
slope at the zero-crossing is positive. All 
a n -curves exhibit a maximum at some value 
Hmax n 0* Hence, it is seen that convergence 
to the stability boundary is assured only if the 
iteration process is started at a point yU Q 
located to the left of the respective maximum. 

(The iteration would proceed in the opposite 
direction if the initial point were located on 
the other side.) Starting the iteration with a 
case of positive damping, 0<jU Q <{U raax n , it is 
observed that, owing to the convex character of 
the a n -curves, the first iteration step always 
overshoots into the region of negative yu from 


* 


Solutions such as those shown in Figures 3> 4 
and 7 are representatives of the family. 
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where the convergence to ^ 3 = 0 is quite rapid. 

The sequence of iterated solutions depicted in 
Figure 7 confirms this observation. On the other 
hand, if the iteration is started in the unstable 
region (i w o< 0 ) convergence is always assured 
regardless of the position of the starting point. 

Inspection of the a n -curves reveals, further- 
more, that the location n of the maxima 

moves rapidly in the directioA of /J. = 0 for 
increasing n-values, i.e. the range of convergence 
in the stable region of (A is narrowed down if 
amplitudes other than the first few are picked for 
the prediction process. In the case of the first 
amplitude pair, n = 1 , the convergence range is 
exceptionally large extending to values in the 
neighborhood of critical damping, {a cr it = 2 /X. 

The above considerations are based on the 
case of a linear second order system but apply 
more generally to higher order systems exhibiting 
several distinct modes of oscillation. This is 
true because a particular mode of interest has the 
same structure as the single mode of oscillation 
characterized by the solutions shown in Figure 7. 

In the cases of practical interest a dominant mode 
of oscillation exists for which the stability 
boundary is being sought. In the presence of 
initial transients the amplitudes of the dominant 
mode will only he detected after some time inter- 
val. Hence, based on the information obtained 
from the a n -curves of Figure 9, there exists only 
a small range of positive damping over which con- 
vergence to the stability boundary is assured. In 
these cases an iterative approach from the 
unstable region is preferable. 

Figure 9 illustrates one other interesting 
fact which concerns the choice of single vs. mul- 
tiple intersections, i.e. the use of equation (18) 
vs. (I9), respectively. It is seen that, except 
in a close vicinity of the boundary point 0 

where all a n -curves have a common tangent, tne 
multiple intersection approach yields a cluster 
of increments While this disparity of 

predictions tends to degrade the rapidity of con- 
vergence of the iteration process, an averaging 
of predicted values is desirable whenever the 
read-out accuracy of successive maxima is limited, 

i.e. in the majority of analog computer opera- 
tions. 

It is expected that the essential features of 
this analysis carry over into applications in- 
volving non-linear oscillations, although this has 
not been studied in detail. In view of the dif- 
ferent laws governing the growth or decay of ampli- 
tudes in non-linear oscillations, the a n vs. 
fl curves may look substantially different from 
those shown in Figure 9* However, as long as a n 
is a continuous and well-behaved function of the 
parameter [A. near the stability boundary point, 
the iteration process in general is a convergent 
one and thus can be used to find the boundary. 
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Summary 

Looking into the future of data processing 
technology, it is possible to identify major trends 
in specific areas, viz. , components and circuits, 
data systems and equipments, programming and 
language problems, human factors and overall 
system design, and learning and cerebral systems. 
The real advances and the real broadening of the 
field are directly tied to our ability to make data 
processing machines more truly the direct exten- 
sions of the human intellectual process. 

Introductory 

Outside of proving the author wrong years 
hence, crystal gazing has two important functions: 
to assess and summarize present viewpoints, and 
to stimulate thinking in new directions. 

In the dynamic and developing field of data 
processing, looking at the future is particularly 
difficult and also particularly interesting. For an 
industry that started to find itself little more than 
a decade ago, economic growth and technical prog- 
ress have been spectacular. From the early relay 
machines performing their operations at humanly 
perceivable speed, through the long hot spell of 
electron -tube giants, to the ultra high-speed mag- 
netic-semiconductor computers the road has led 
through an astonishing number of difficult engi- 
neering achievements. Direct coding of simple 
numerical methods led to the complex structures 
of automatic assembly routines and common lan- 
guage programs, and thus the new science of pro- 
gramming came of age. The use of these data 
processors extended from scientific calculations -- 
for which they were originally intended- -into all 
facets of business, military, industry, and govern- 
ment. From isolated research projects there has 
grown a billion-dollar industry. 

The end is not in sight! There will be more, 
there will be bigger, there will be faster, and 
there will be more useful systems. But what will 
they be? In looking at the future, we are really 
examining the needs of the present because if we 
can assume that our endeavors will be in the di- 
rection of fulfilling these needs, then we can pro- 
ceed to look at the technical feasibility and the 
schedule of their fulfillment. 

The data processing field is a combination 
of many efforts, whose somewhat fragmented 
examination will not immediately give the total 
picture that we desire. However, let us try to 
synthesize the overall endeavor after separately 
discussing: 

♦ Components and circuits; 

♦Data systems and equipments; 

♦Programming and language problems; 

♦Human factors and overall system design; 


and 

♦Learning machines and cerebral systems. 

Circuits and Components 

There is no question that progress will con- 
tinue to be made toward smaller and faster com- 
ponents and circuits. For some time to come 
limitations in this respect will be the invention and 
utilization of materials, development of automatic 
and carefully controlled production techniques, 
and in general the ingenuity and resourcefulness 
of the designers. But eventually, and even as we 
go along, more fundamental limitations having to 
do with basic considerations in information theory 
will appear. Signal-to-noise ratio and heat dis- 
sipation will combine to make some dreams diffi- 
cult to realize, others impossible. It can be shown 
that to transmit a bit of information, some mini- 
mum energy, however small, is required. As 
more bits are to be moved in a given time, the 
energy per second or power will go up. To dis- 
sipate this energy might require means which can 
invalidate the advantage of speed and miniaturiza- 
tion. (Figure 1.) 

Recent inventions in molecular electronics, 
or molectronics, have aroused considerable inter- 
est. Without a doubt this method has tremendous 
promise. Practical and useful circuits will, in 
the near future, literally be "grown" for incor- 
poration into actual equipment. But we will not 
grow complete computers for many years, and 
probably we should never do so. 

Of more immediate interest are tunnel 
diodes, which serve as an example of the kind of 
developments to be expected in the semi-conductor 
field. The geometry and construction of the tun- 
nel diode suggest some short-term possibilities 
of multi-electrode tunnels for more complex 
circuit structures. 

Thin film memories certainly have a future. 
What is needed here are means of making measure- 
ments during the fabrication process so that it can 
be controlled; furthermore, efficient means should 
be found for making some of the selection cir- 
cuitry itself of the thin film type. Thin film mem- 
ories have been a long time coming and there is 
some question whether, by the time they become 
practical, other developed methods will not have 
filled the gap. 

The potential of cryogenic circuits would be 
vastly more powerful, were it not for the inherent 
volatility of information. To avoid the embarrass- 
ing forgetfulness a computer could experience due 
to a low supply of liquid helium, many problems 
Will first have to be solved. In addition to prac- 
tical and reliable cooling devices, the combined 
use of cryogenic and non-volatile circuitry might 
offer one potential measure of safety. For short- 
term use such as a missile flight volatility is not 
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a serious handicap. 

In the area of miniaturization some more 
immediate methods and results can be seen. The 
micro-modules that have been developed are cer- 
tainly effective but use components with leads, 
and therefore require laborious automatic or man- 
ual means of putting them together and soldering 
them. Recently some advances were made in the 
utilization of leadless components and of entirely 
new ways of plugging circuit modules into each 
other. These methods, which may be practical 
today, permit economical circuit construction. 
(Figures 2 and 3.) The components would be 
mounted within non-phenolic boards permitting the 
stacking of the circuit modules within any distance 
from each other. Components themselves are 
already so small (Figure 4) that the problems of 
miniaturization will have more to do with the inter- 
connections and the heat dissipation than the size 
of the components themselves. 

Data Systems and Equipments 

While the trend toward bigger, faster and 
more capable computers is likely to continue for 
some time, this trend will not be able to keep up 
with the ever increasing complexity of applications. 
Under such conditions, a problem will have to be 
segmented and performed on separate computers 
operating simultaneously. However, this approach 
might put serious limitations on the user, unless 
vastly improved communications and interconnec- 
tions are possible among the various units of a 
large system. This is then the new direction: 
systems consisting of any number of different units 
(large and small computers, buffers, all kinds of 
memories, input and output units, displays, ana- 
log and digital units, etc. ); high-speed automatic 
and completely flexible interconnections among the 
various units or modules of the system; and finally 
high-speed and automatic communication between 
systems. 

Presently, there are many systems which 
can be called modular in the sense that various 
peripheral units can be added to the system for 
improved capability. But usually these systems 
have most of their controls centralized in the sin- 
gle computer whose failure will stop the operation 
of the whole system. A system whose controls 
are distributed over a large number of units, 
whose only central element is an electronic switch- 
ing network, and whose modules can be automati- 
cally interconnected and reconnected in almost 
any manner has been developed and named "poly- 
morphic" or many- shaped. (Figure 5.) This 
concept of system design will spread not only to 
newly designed equipment but also to the consoli- 
dation of many presently available modules into 
polymorphic complexes. 

All digital computers so far marketed are 
one of two types: so-called algebraic or general- 
purpose (GP) and digital differential analyzer 
(DDA). It is somewhat disappointing to state that 
no fundamentally new and practical approach to 
digital computation looms on the horizon. As 
noted above, the innovations will arise in the man- 
ner in which these conventional computing and 
control units are used and interconnected. 


However, an extension of computer organization 
techniques along the lines of a variable instruction 
repertoire is a distinct possibility. Such a com- 
puter will be able to act as any of several differ- 
ent computers so that it can accept programs 
prepared for other computers and perform them 
at the same speed. This can be a significant ad- 
vantage in a polymorphic system consisting of 
different types of units and in general in breaking 
down the language barriers between computers. 

Strides will be made in the communication 
of data within and without a system. Higher speeds 
in electronic switching, error detection and cor- 
rection equipment, message traffic consoles, 
message generators are among the devices to be 
used with increased frequency. Communication 
between the human and the machine will receive 
vastly increased emphasis, as will be noted later. 

Programming and Common Languages 

Advances in programming are extremely 
important but seldom spectacular. This has been 
the case in the past, and this will be the case in 
the future. A massive and steady effort is essen- 
tial to enhance the usefulness and effectiveness 
of the ever-increasing number of data processing 
systems. 

There is a bewildering variety of computer 
order structures, and codes, with the result that 
no individual can thoroughly know more than a few. 
Consequently, there exists a tendency for pro- 
grammers to become specialists in particular 
computers, or series of computers. This trend 
will continue for some years, but it will be par- 
tially counteracted by the more general use of 
automatic techniques in programming. The sheer 
volume of programming to be done will make it 
mandatory to develop more advanced machine 
methods to let the computer help prepare its own 
programs. Years from now such self-program- 
ming techniques will incorporate automatic learn- 
ing methods which will be discussed in a later 
section. 

There will be, and there should be, a steady 
effort toward standardization of codes, instruc- 
tions, flow diagrams, program bookkeeping aids, 
etc. However, one must not expect rapid solu- 
tions; progress will be step-by-step, and problems 
will be met one-by-one. The design of common 
languages is an encouraging step in the direction 
of standardization, but this itself points up two 
difficulties: first, there are "languages" in plural, 
which indicates the limitations on their being 
"common"; second, the very use of common lan- 
guages points up the difficulty of standardizing 
problem languages and machine languages them- 
selves. 

It seems that general purpose (stored pro- 
gram) computers will often be applied in special- 
ized areas of application mainly because program- 
ming and other supporting activities will be aimed 
in such directions. The programming and appli- 
cations engineers will explore certain uses which 
then become the special purpose of the general 
purpose computer. This points up the fact that 
the importance of the programming support in 
relation to the hardware effort itself will increase. 



Programming methods will also have to keep 
pace with the new developments in computer or- 
ganization and system uses. In the flexible multi- 
computer systems mentioned in the preceding sec- 
tion one of the computers acts as master and 
determines which units of the polymorphic system 
will carry out what task and when. The design of 
the master control programs is a new and chal- 
lenging programming endeavor. 

Operational systems are often used by a 
large number of humans who are at best only 
moderately trained in computer and programming 
technology. Means should be provided for them to 
get at the programs readily and to perform modi- 
fications with relative ease. This problem in 
programming has hardly been touched, but its 
solution would result in efficient adaptive man- 
machine systems. 

Another need in intellectronic systems is a 
quick machine response to random and variable 
requests of the operators. A programming tech- 
nique which may offer a solution is the use of 
programs which are segmented into their compon- 
ent parts. These parts would be put together in 
accordance with the task to be performed and at 
the time that the task is to be performed. We can 
think of this method as on-line assembling and 
compiling, the need for which will increase with 
real-time applications of data processors. Where 
the inputs to the data processing systems are ran- 
dom requests coming from various sources and 
exhibit significant variations from time to time, 
the on-line composition of a program to handle 
each specific request may prove more efficient 
than the storage of individual programs to answer 
all possible requests. 

Human Factors and Overall System Design 

Data processing systems--particularly real- 
time operational systems --will be more truly the 
servants and the extensions of the human intellect 
if the interrelationships between man and machine 
are given appropriate consideration at all stages 
of system and equipment design. In intellectronic 
systems the human needs and the operator charac- 
teristics are basic factors. 

In the early stages of computer development 
the design of memory, arithmetic and control 
circuits and equipment was the first consideration, 
and the matter of input and output often became 
an afterthought. This trend has, of course, been 
reversed and ever increasing attention is being 
paid to the input and output problems and the con- 
venience of the operator. Yet, the operator must 
often be carefully trained in order to make it pos- 
sible for him to use present-day machines appro- 
priately and adequately. As more systems are 
used in real time, the careful training of the 
operator in order to permit him the use of the 
system is not always feasible. It is necessary, 
therefore, to make designs which allow the use of 
the system by a human only moderately trained. 

As man becomes a "black box" in the over- 
all system, human factors studies will become an 
integral part of system design. Two aspects of 
human factors can be distinguished: human engin- 
eering, which concerns itself with the relationship 
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of the human and specific piece of equipment such 
as a display device, or an input console; and man- 
machine system studies, which concern themselves 
with the overall interactions in a system complex 
consisting of humans and equipments. The latter 
aspect of human factors is gaining increased sig- 
nificance both in the military and in the industry. 

Think of using the library one day in some- 
what the following manner. You sit down at a 
desk-like piece of furniture (Figure 6) and you 
insert your library c-ard in an appropriate slot. 

Then you type out the subject matter for which you 
are searching. Note that you do not look up any 
indexes, glossaries, books on codes, synonyms, 
etc. Having typed in your request, an abstract 
flashes on the screen for you to examine. From 
the nature of this material you determine how well 
or poorly you might have stated your request. For 
example, if the abstract indicates that your request 
must have been too vague, you may type in addi- 
tional words or descriptions for increased clarity. 
As you keep on examining abstracts, you keep 
modifying your requests as necessary. Finally 
the retrieval reduces to, say, a dozen relevant 
abstracts. You go back and re-examine these 
abstracts, which appear by easy push-button 
control. This more thorough examination might 
reduce your total interest to three articles, or 
books, which you then indicate by a push-button as 
being your selection. The selected material slides 
down the chute at the left lower section of your 
desk. You remove your library card from the 
slot, pick up the books, walk out; you are already 
charged for them. 

The type of human factors design considera- 
tion that must go in the future library system will 
be part of the design of many intellectronic systema 
Such designs are not as far in the future as one 
might think; they are starting to be applied in 
military command systems and they will be shortly 
applied in management systems of business enter- 
prises. Teams of engineers, psychologists, phys- 
iologists, and other specialists will rapidly be- 
come part of any large data processing design and 
applications effort. 

Part of any systems effort is the design and 
development of individual items of equipment. In 
systems where the human is also an element, his 
design and development should also become part 
of the systems activity. Corresponding to the 
design in the case of a human is the selection of the 
human from the random population, while the 
development effort in this case consists of the 
training of the human. Both the selection and 
training of personnel must therefore become an 
integral part of the system design effort. It should 
never be the design engineer himself, or his col- 
leagues, who try out the system for its operational 
feasibility; rather the type and calibre of personnel 
who will actually use the system must also be the 
participant in the experiment. In this manner one 
has a greater assurance that the system will serve 
its operational purpose. 

Considering the training problem one can 
readily see how the difficulty of a training method 
can react on the design of the overall man-machine 
complex. There must be a balance of trade-offs 
between the complexity of the system and 
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procedures, and the difficulty of training the oper- 
ator to use the system. Once the system is de- 
signed and operational, an important task is the 
training of operators to use these systems. From 
the point of view of future development in this area 
we will see more automatic aids for training peo- 
ple. Training machines will be special-purpose 
devices having the appearance of the actual human 
factors consoles and displays to be used in the 
system, but having simple special-purpose storage 
and data processing capabilities which can simu- 
late the actual overall data processing system. 

The topic of training brings us naturally to 
the educational field. Our new capabilities in data 
processing and human engineering are bound to 
have a long-range effect on our educational methods 
and indeed, on our whole educational system. 
Already, television and closed circuit television 
have both demonstrated the tremendous potential 
of new techniques in education. Incidentally, the 
successful introduction of such methods depends 
to a large degree on human acceptance. The psy- 
chologists concern about how the student reacts 
to a two-dimensional teacher has just as much to 
do with the success of this new method as the 
solution of the technical problems of transmitting 
a picture. In an effort to find out more about the 
human element in an educational system, a large 
number of measurements have already been made 
and such new scientific disciplines as neurophys- 
iology, and psychophysiology have been identified. 
Emotional and physiological reactions are measured 
in order to optimize the student-machine -teacher 
systems and methods. The bookkeeping and stu- 
dent control problems themselves of an educational 
institution also deserve the fullest consideration 
of data processing technology. Training aids, 
study rooms, and decentralized user consoles for 
centralized libraries are some of the items sub- 
ject to automation in future intellectronics systems 
for education. 

Learning and Cerebral Systems 

One of the most fascinating and interesting 
research activities of our field will concern self- 
adaptive and statistically structured automata. 
Presently this whole problem area is being attack- 
ed on at least three different fronts: (1) cybernetics, 
which leans on neurology, physiology, and biology 
to study the detailed microscopic behavior of com- 
plex human and animal systems; (2) psychology, 
which is interested in human behavior in the mac- 
roscopic or overall black-box sense, and (3) data 
processing, which investigates programmed or 
structured automata from the point of view of 
their similarity with living cerebral systems. 

The interaction among these three fronts have 
been noteworthy, and will become much more 
substantial in the future. Progress in this field 
will be difficult to come by, but the results could 
be quite spectacular. It is a field where, hopefully, 
more of our talent will become active, and where 
more research dollars could be rightfully spent. 

Considerable work has been performed on 
structured machines in which a certain number of 
inputs are connected to a certain number of out- 
puts through redundant paths. The nature of these 


paths can be adjusted on the basis of performance 
and this provides the system with adaptive or 
learning capability. Often these systems are 
simulated on a programmed computer. The struc- 
tures so far examined have been very simple 
compared to the complexity required to simulate 
a cerebral systdm. But even so, the preliminary 
results are interesting and promising. 

As the number of inputs and outputs in a 
structured machine increases, the complexity of 
the device assumes astronomical proportions. A 
truly meaningful learning device would have bil- 
lions of bits of information stored, and these cells 
would have to be interconnected by channels having 
complex adaptive mechanisms. To implement 
such a device, component and circuit development 
would have to advance by orders of magnitude. 

Yet, we do not have to proceed all the way. It is 
feasible to construct within years structured learn- 
ing machines of complexities which, although 
inadequate for practical use, will be meaningful 
for significant contributions to this area of research. 
It is time to apply appropriate circuit talent di- 
rectly to the problem of implementing, at least in 
a semi-practical fashion, the next degree of com- 
plexity of such machines. The kind of circuits 
that will be required will probably not be straight 
binary or straight digital, but will exhibit mix- 
tures of continuous and discontinuous behaviors. 

It is a new horizon for the circuit specialist. 

Closely related to the field of learning ma- 
chines is pattern perception. We are starting to 
learn quite a bit about what parameters and 
characteristics are important in describing a 
pattern. Thus, we know in some cases that a 
point-by-point description of a shape or event is 
unnecessary, and we further learn about what 
parameters are necessary to describe the event. 
What is less known is how to measure these param- 
eters directly. At the present time we are still 
forced to measure a shape or event point by point, 
and to use automated inductive reasoning in deriv- 
ing or synthesizing the desired parameters. Again, 
it will be a mixture of digital and analog techniques 
as well as a mixture of definitive and statistical 
considerations that can provide us with some 
answers. 

Four stages of development can be distin- 
guished in the pattern automation field: pattern 
following, pattern generating, pattern matching, 
and pattern recognizing. Pattern following is a 
straight analog servo problem extensively ex- 
ploited in the machine tool control field over the 
past years. Pattern creation, or generation, is 
again used in the machine tool field. For example, 
a circular pattern is generated by defining such 
parameters as the center of the circle, its radius, 
and the beginning and end of the arc. Pattern 
matching serves to automatically determine the 
similarity of two patterns. For instance, using 
optical means, it is possible to recognize whether 
a specific word or set of specific words is, or is 
not, contained on a printed page and where on that 
printed page it is contained. Correlation tech- 
niques have been developed to get a certain con- 
sistency and reliability in recognizing predefined 
patterns, which may be keywords or signatures. 

The importance of this in library search and 
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retrieval, as well as in the field of business and 
banking, is readily visualized. Pattern recogni- 
tion is the last and ultimate step. It is quite con- 
ceivable that pattern matching techniques in com- 
bination with feed-back or trial-and-error methods, 
may become the mechanism by which patterns can 
be recognized much before straightforward pattern 
recognition methods are fully developed. At any 
rate, this area poses a real challenge and the 
promise of leading to most significant future 
results. 

Learning and adaptive systems are rapidly 
becoming not only important catchwords to attract 
contractual support, but also practical devices 
that might improve the performance of control and 
data processing systems. For instance, in the 
field of machine translation of languages, the 
translations are now scrutinized by human linguists 
who find the shortcomings and change the program 
in order to improve the automatic translation. 

The next step is the design of programs that can 
improve their translating capability upon the mere 
indication by the human that a shortcoming of a 
specific sort has been observed. The same com- 
ments are applicable to indexing and abstracting 
systems which would constantly monitor the 
satisfaction or dissatisfaction of the human con- 
sumer in order to improve their methods of 
indexing and retrieval. These methods could also 
be applied to the development of common languages. 

The tasks ahead of us in this field are tre- 
mendous and require the support of substantial 
research efforts over periods of time, not to be 
considered short. 

Summary 

When we call ourselves a computer society, 
most of us realize that the word "computer" is 
used to cover a multitude of related activities. 

In trying to describe the broadening horizon of our 
activities, we started to use expressions such as 
"data processing", "cybernetics", "information 
systems", "automation", and more recently, 
"intellectronics". These words are not just words: 
they are coded symbols of our professional aims. 

From our brief review of things to come, we 
can conclude without a doubt that the advances in 
our technology will be startling, and indeed they 
seem unlimited. Computers will be small enough 
to become vastly more useful in many new appli- 
cations, and powerful enough to perform almost 
any conceivable task. Programs will become more 
generally useful to facilitate the solution of a 
greater variety of problems. But the real advances 
and the real broadening of our field are directly 
tied up with our ability to make these machines 
more truly the direct extension of the human 
intellectual process. 

Computers are sometimes named "bright 
but ineffective children". When we consider the 
time required for a new data processing installa- 
tion to become fully effective, or the time to pre- 
pare a new problem for the computer, or the 
difficulty of translating a program prepared for 
one computer for use on another, or the length of 
training required to make the operator properly 
conversant with the machine, or the procedure 


required to make even a simple change in a pro- 
gram while it is in the computer, or the unnatural 
manner in which the computer would go about 
recognizing a pattern, or the clumsy indirect 
means of communication between the human and 
the data processor, then we must admit that 
there is a lot to be done to gain effectiveness. 

These problems are our new challenges, and 
their solution will constitute the milestones of 
this developing technology. 

The effort that will be put into new and 
miniaturized equipment, greatly advanced pro- 
gramming techniques, vastly improved communi- 
cations between machine and machine, man and 
machine, machine and man, and perhaps man and 
man, statistically structured information systems, 
will be justified--in the final analysis--by the 
economies and the effectiveness of future intel- 
lectronic systems. The increased scientific and 
engineering support needed by this field will pre- 
sumably come from the vastly increased utiliza- 
tion of automated systems in the military and in 
the industry, or indeed, by the general public. 

The applications willcover a broad spectrum. 

Data processing and computing aids will permeate 
all facets of our life. But progress will be marred 
by an effect which industry does not await with 
pleasant expectations. The ratio of research- 
scientific -engineering dollars to be expended for 
each production dollar, and perhaps even for each 
dollar saved in operation, will continue t© increase. 
This is the price of automated intellect, and this 
is the price which must stay commensurate with 
the conveniences gained and with the growth of 
our economy. 

The wider application of automated aids to 
our intellectual processes might have some 
interesting effects on our approach to problem 
solution. The tendency of not developing a broad 
base of fundamental scientific know-how but rather 
depending on the ready means of having reference 
to relevant material and eventually having at our 
disposal- -pardon the expression- -"thinking aids", 
is a truly significant trend of our times. Today we 
might say "I will read it when I need it, " and 
tomorrow we might say "why read it when I can 
get the answer to my question directly?" Such 
tendency is very much in line with the growing com- 
plexity of our everyday life. 

It is hard to see where this intellectual revo- 
lution of the Twentieth Century will lead. For it 
is important to maintain the proper perspective on 
what it is we are up to. Let us not propose to 
install a super computer in the White House to run 
the country by direct control. Nor let us squander 
our energies and resources in providing the house- 
wife with a computer to let in the cat in the morn- 
ing. Between profundity and trivia, let us remembei 
the human element and steer a prudent course to 
the benefit of our society. Only the practical ful- 
fillment of sound human needs can guarantee the 
healthy growth of our technology. 
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THE OUTLOOK FOR MACHINE TRANSLATION 
Franz L. Alt 

National Bureau of Standards 
Washington, D. C. 


Introduction 


The idea of using electronic digital computers 
for language translation seems to have arisen about 
1946; it was first brought to widespread attention 
in a memorandum by Warren Weaver 1 in 1949, Today 
there are about a dozen projects in the United 
States devoted to machine translation, mostly trans- 
lation from Russian to English. Many additional 
projects are located in the Soviet Union, a few in 
other countries. Many of these projects have been 
in operation for a number of years. Yet today 
there is nobody in a position to feed some foreign 
text into a computing machine and produce automat- 
ically an acceptable translation. This is not to 
say that there have been no accomplishments. Some 
achievements may be noted, in some other areas 
there has been considerable progress; and it seems 
plausible that in a few years completely automatic 
translation of fair quality will be possible. 

Most projects in the United States have been 
devoted to translation from Russian to English, 
although French and German have also been consid- 
ered as source languages. In some projects, 
studies of the English language have been conducted 
in the hope of benefiting machine translation. The 
effort in the Soviet Union, from the scant informa- 
tion available, seems to be of a magnitude 
comparable to that in the United States, but spread 
over more language pairs. Although the two major 
projects in Great Britain have considered transla- 
tion from French, German and Russian into English 
they have concentrated mainly on general questions. 
Not only do different projects work on different 
language pairs, but they frequently attack differ- 
ent portions of the translation problem. Many 
limit themselves to source tests in specific fields, 
such as chemistry or mathematics; some concentrate 
on the theories of linguistic structures; some on 
the compilation of a dictionary; some on methods of 
transcription of the original text onto punched 
cards or other machine media; some on so-called 
automatic programming for language translation, 
which in this case means the creation of general- 
purpose machine programs which can perform a 
variety of linguistic operations. When they deal 
with the central problems of machine rules for 
analyzing the source words and synthesizing target 
sentences , there are many differences in the methods 
which are used. In order to appreciate these dif- 
ferences, we shall first have to survey the main 
difficulties in the way of translation and the 
methods that have been proposed to overcome them. 

Methods of Machine Translation 

When the idea of automatic translation 
emerged, at first only word-for-word translation 
was considered. The inadequacies of this plan were 
obvious. They manifested themselves principally in 
the facts that some source words had more than one 


target word associated with them, and that the 
word order in the target language often has to be 
different from that in the source language. 

Speaking of multiple target words and of word 
order, however, is not the most useful approach to 
classification of problems. The change in word 
order can be understood by analyzing the grammar 
of a sentence. Multiple target words can be either 
different grammatical forms belonging to the same 
stem, or words having entirely different meanings. 
Thus, in the main, the problems encountered in 
translation are classified into syntactic and 
semantic problems. 

The first attack on semantic problems was 
outlined in the memorandum by Warren Weaver 1 in 
1949. Early ideas ran as follows: A word like 
"nucleus" might have one meaning in a context of 
physics, another meaning in a context of biology, 
etc. There are other words which occur only in 
contexts of physics, still others which occur in 
biology, and so on. In the dictionary, each word 
would be coded according to the fields in which it 
is used. If a word of multiple meaning is encoun- 
tered, a number of neighboring words both before 
and after the questionable one would be searched 
in order to get a "majority opinion" concerning 
the field with which the present text is concerned. 

This idea of word classification by context 
does not dispose of the problem entirely. The 
choice among multiple meanings may be determined 
not by context but by certain successions of words. 
For instance, the same Russian word may have to be 
translated into English as "prove" in the sentence 
"Prove the theorem of Pythagoras" or as "show" in 
the sentence "Show me how to construct a regular 
pentagon". Attempts are being made to establish 
large-scale statistical information about pairs of 
words occurring together. Another approach has 
been to attempt a more refined classification of 
word meanings, resulting in something like an 
oversized Thesaurus. Still other investigators 
maintain that the magnitude of the semantic 
problem can be greatly reduced by attempting when- 
ever possible to find neutral translations which 
will cover as many meanings of the source word as 
possible . 

The three approaches to the semantic problem 
which we have just outlined might be named the 
statistical , systematic and empirical approach. 

The same three methods of approach can be distin- 
guished in dealing with the syntactic problem. 

Here, the statistical approach consists in 
searching through large amounts of source texts 
and enumerating the frequency of certain word 
sequences. For instance, how often does an 
adjective precede a noun, how often does it follow 
it? The systematic approach attempts to set up a 
system of rules — in other words, a machine program 
— which analyzes the syntactic structure of each 
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source sentence. That is to say, it identifies 
subject, predicate, direct object, etc. of each 
sentence or clause. This frequently has to be 
preceded by a grammatical analysis of each word, 
just as conventional grammar is divided into 
morphology and syntax, the former dealing with the 
inflectional forms of each word, the latter with 
the function of each word in the sentence. Finally, 
the empirical approach starts by selecting a few 
very simple rules for translation, tries them on a 
body of text and notices where they fail, corrects 
the rules or introduces new ones to cope with the 
observed failures, tries the revised rules on a 
larger body of text, and so forth. 

From a slightly different viewpoint we may 
distinguish between the use of conventional grammar 
and the design of new systems of grammar (or 
"linguistic structures", as they are called), which 
are intended to be better suited to mechanical 
analysis than is conventional grammar. 

There are other differences among the various 
groups working on machine translation. For 
instance, in designing the dictionary or glossary, 
some propose to list in the glossary every inflec- 
tional form of every source word, while others 
propose to list only the stem, or equivalently 
some canonical form such as the infinitive of a 
verb and the nominative singular of a noun. 

Russian nouns have a dozen inflectional forms, 
adjectives and verbs many more. Thus the size of 
the required glossary is greatly affected by this 
decision . 

Other differences are found in limiting the 
scope of a machine translation project. Some 
groups are satisfied to translate into a kind of 
pidgin English. Some are resigned to leaving 
certain semantic ambiguities unresolved and print- 
ing out multiple meanings. Some are willing to 
admit failure in a small percentage of all cases. 
Some will even admit undetected errors in the trans- 
lation (The latter is a point of view which I 
consider dangerous). Some propose to use a man- 
machine partnership rather than letting the machine 
do the entire job. In these cases the machine 
prepares certain aids to translation — at best a 
kind of preliminary draft, and these are used by a 
"post editor" in producing a polished translation. 
Pre-editors are less frequently contemplated, but 
a certain amount of pre-editing may be combined 
with manual key punching of the text. 

It is my feeling that, in dealing with the 
syntactic problem, the empirical approach is most 
likely to give an early appearance of success, by 
leading in a short period of time to a system for 
producing less accurate but serviceable transla- 
tions. It will prove quite difficult to improve the 
quality of translations later on. The systematic 
approach may take a little longer in achieving its 
first concrete results, but these will be of higher 
quality and will lend themselves readily to further 
improvement by adding more and more refined rules 
of grammatical analysis. The statistical approach 
to syntax seems to me to be far from successful. 

On the prospect of success in the semantic problem 
I have no present opinion at all. 


The problems associated with mechanical trans- 
lation may perhaps be better appreciated by looking 
in detail at one example. For convenience I 
propose to use the work now in progress at my own 
organization. 

The project at NBS, under the direction of 
Mrs. Ida Rhodes, concentrates on problems of syntax 
and dictionary organization. In its approach to 
syntax it stays close to conventional grammar. We 
may visualize the proposed machine program as con- 
sisting of two parts, the first concerned with 
glossary look-up and morphological analysis, the 
second with syntactic analysis. We look upon 
constructing such a machine code as similar to 
setting up a mathematical theory of language. Some 
idiosyncrasies of the project arise from the fact 
that its staff consists mostly of mathematicians, 
and coincidentally we are using mathematical texts 
for our experiments. 

The dictionary look-up proceeds on the assump- 
tion that, in any foreseeable machine scheme, 
memory capacity will be a bottleneck. At this 
point of our exposition this is a mere assumption, 
but we shall see soon that it is well founded. It 
will be impossible to store the entire dictionary 
in the internal memory of the machine. It will be 
stored on an external medium, for instance on 
magnetic tape or on a magnetic disc memory. Since 
reading from an external medium into the computer 
is relatively slow, every effort is made to store 
the dictionary as compactly as possible. This 
desire explains some of the unusual features of 
the machine program. 

The program starts by reading a few hundred 
words of source text into the machine. Each word 
is decomposed into its inflectional ending, 
prefixes, suffixes, and the root. Endings, 
prefixes, and suffixes are identified by compari- 
son with stored lists, and are replaced by numer- 
ical codes. The roots are put in alphabetical 
order and are then looked up in the dictionary by 
a single pass through the external medium stored in 
the dictionary. There, we find the source words 
stored in alphabetical order of the roots, listing 
under each root those combinations of prefixes and 
suffixes which occur in the source language in 
combination with this root, each followed by 
grammatical information about the source word and 
by one or more English equivalents. This informa- 
tion is read into the internal memory. 

The foregoing exposition is oversimplified in 
several respects. Sometimes the machine obtains, 
in place of a prefix, a group of letters which only 
looks like a prefix but which grammatically is not 
one. Such a group is called a pseudo-prefix. 
Similarly there are pseudo-suffixes. What is left 
after splitting them off is called the pseudo-root. 
For instance, if the method were applied to Eng- 
lish, the word "conifer" would be split into the 
pseudo-prefix "con", the pseudo-suffix "er" and the 
pseudo-root "if". Another point that has been over- 
simplified is the arrangement of the dictionary. 
Although we called it "in alphabetical order of the 
roots" the program actually uses a more elaborate 
and more economical scheme. 
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The information obtained from the dictionary, 
together with the previously identified ending, 
results in a morphological description of the word 
in question, which is stored for later use. In 
the second part our code works on one sentence at 
a time and establishes the elements of each sen- 
tence, such as subject, predicate, direct object, 
etc. In this process we use a device which, to the 
best of my knowledge, is unique. As each word is 
identified, it is used to "predict" other sentence 
elements. For example, a transitive verb predicts 
a complement in the accusative case, etc. All 
these predictions are stored and each new word is 
compared with them. If no match is found, this 
very fact is stored in the expectation that it will 
be resolved by a subsequent word of the sentence. 
Thus, throughout this part of the program we have 
two pools of stored information which we call fore- 
sight and hindsight, and which assist in determin- 
ing the syntactical function of each word in the 
sentence. This syntactical information, in turn, 
indicates how our translation should differ from 
word-for-word translation. For example, it indi- 
cates changes in word order, the insertion of Eng- 
lish prepositions like of or by, etc. 

This account of the method being developed at 
NBS is necessarily brief and omits many essential 
features, for instance, how we deal with clauses 
and phrases within a sentence, or with word forms 
which are morphologically ambiguous. What I have 
said will, however, suffice for our present pur- 
poses. To date the dictionary look-up and 
morphological analysis has been coded, and the 
syntactic analysis is in process of being 
programmed. 

Survey of Projects 

The most important ones among the mechanical 
translation projects in Western countries will be 
enumerated here, in geographical order from West 
to East, and each project briefly characterized. 

O 

For a recent complete survey, see e.g. Bar-Hillel. 

1. University of Washington, one of the 
oldest projects, concerned with translation from 
German and Russian to English. Extensive diction- 
ary of inflected forms. Word-for-word translation, 
study of selected grammatical and semantic problems. 

2. University of California, Berkeley, one of 
the youngest projects. Russian to English. 

Emphasis on economy in the use of the dictionary. 

3. Rand Corporation, Russian to English, 
empirical and statistical approach, some work on 
semantic problems . Large corpus of transcribed 
Russian material in Physics and Mathematics. 

4. Ramo-Wooldridge , Russian to English, 
empirical approach, emphasis on programming methods 
and on display of results in a form which facili- 
tates continuing revisions. In its approach to 
grammar this project is close to one of the groups 
formerly at Georgetown University. 

5. University of Texas, a newcomer. German 
to English, concentration on grammatical problems. 

6. Wayne State University, a new group, 

Russian to English, statistical approach. 

Cooperates with Ramo-Wooldridge and with a group 
formerly at Georgetown University. 

7. Georgetown University, one of the oldest 


groups, at one time comprised four separate 
projects. One of these is disbanded, its ideas 
being continued at Ramo-Wooldridge and Wayne. 
Another, Russian to English on an empirical basis, 
recently moved from Georgetown to a private cor- 
poration, CEIR. 

A third project at Georgetown, called "Gen- 
eral Analysis Technique", Russian to English, has 
made considerable progress with syntactic analysis, 
which is being developed in part empirically and 
in part systematically, staying fairly close to 
conventional grammar. A good-sized dictionary has 
been assembled and a large corpus of text examined. 
Heavy reliance is placed on post-editing. 

The fourth of the Georgetown projects works on 
French to English, is strictly empirical, lays 
stress on advanced programming techniques. 

There are also small-scale efforts devoted to 
Chinese and Arabic. 

Most projects at Georgetown are concerned 
with source texts in the field of chemistry. 

8. National Bureau of Standards, a relatively 
recent project, Russian to English, systematic ; 
approach to grammatical (morphological and syntac- 
tic) analysis, staying close to conventional 
grammar. Emphasis on economical use of dictionary. 
Uses source texts in mathematics. 

9. University of Pennsylvania, systematic 
approach to grammatical structure of languages, 
esp. English. 

10. IBM Corporation, concerned mostly with 
hardware, with some supplementary systems studies. 

11. Massachusetts Institute of Technology, 
one of the oldest projects, has pioneered in 
developing new theories of the grammatical struc- 
ture of languages. Also works on syntactic 
problems of German- to-English translation and on 
general-purpose programming systems for machine 
translation in general . 

12. Harvard University, Russian to English, 
has compiled a large dictionary and_ worked system- 
atically on methods and machine codes for diction- 
ary compilation and updating, and on word-for-word 
translation. More recently concerned with 
grammatical analysis on lines similar to National 
Bureau of Standards, and with general theory of 
language structures. A related project is in 
operation at the Arthur D. Little Company. 

13. Birkbeck College, University of London, 
apparently the oldest group in the field. German, 
French and some Russian to English. Morphological 
analysis of source words, some syntax, largely 
empirical . 

14. Cambridge University, England. General 
linguistic theory, semantic problems. 

15. University of Milan, Italy. A highly 
theoretical, long-range approach. 

There are a few other projects, either too 
small or too new to be characterized separately. 
They include one at the National Physical Labora- 
tory in England and one in France. 

Economic Considerations 

We now turn to the discussion of the cost of 
using electronic computers for language transla- 
tion. Cost depends on the system used, i.e. on 
equipment and methods. Although we shall mostly 
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deal with cost in dollars, there are elements of 
cost not readily expressible in terms of money, 
such as the damage caused by time delays or by 
erroneous translation. 

There are so many uncertain and unpredictable 
factors in the situation that we can discuss only 
orders of magnitude. In some cases we can estimate 
within a factor of 2 or better; in other cases we 
are lucky to come to within a factor of 10 of true 
cost. Sometimes, rather than assign a cost to a 
process, we shall state the conditions under which 
this process becomes competitive with others. 

Human Translation 

The cost of translating "by hand”, i.e. by 
human translators, seems to range from 0.25 to 4 
cents or more per word. If quoted per page, one 
hears figures ranging from #12 to #25 per page. 

The cost depends on the qualifications of the trans- 
lator, on the technical difficulty of the text, and 
on the degree of perfection, smoothness of style 
and appearance desired. It is cheaper in foreign 
countries than in the United States. As an indica- 
tion of order of magnitude we may use the figure of 
1 cent per word. In this connection "word" means 
the usual average of five characters followed by 
one space. To avoid confusion with the "machine 
word", the unit of memory space in computers, some 
authors use the word "grubit" for the latter. It 
happens that on many machines a word corresponds to 
about six characters. 

Apart from the monetary cost, human translation 
suffers from its slowness and from the scarcity of 
experienced translators, especially for technical 
subjects. The quality of translation is variable, 
and from time to time gross errors appear which 
could be avoided by translators with greater tech- 
nical competence or by the use of very detailed 
technical dictionaries. By and large, however, the 
quality of human translation is far superior to 
anything in sight of mechanical translation. 

The cost of mechanical translation may be 
divided into the two elements of initial cost and 
current cost. 

Initial Cost 

Under this heading we shall discuss the cost 
of developing programs and codes for automatic 
translation. 

As always in cost accounting questions, there 
is some ambiguity about where some specific items 
should be charged. In the present problem, it 
could be argued that the initial cost of developing 
the hardware should be treated as an element of the 
initial cost of translation. It is, however, my 
contention that mechanical translation will, if at 
all, be performed by general-purpose equipment 
which would be developed in any case, regardless of 
its application to automatic translation. 

Current spending in the United States for the 
development of machine programs for translation 
probably amounts to between 2 and 3 million dollars 
per year. The effort maintained in the U.S.S.R. is 
of the same magnitude, though perhaps at lower cost. 
Including projects in other countries, an amount 
between 5 and 10 million dollars per year is being 


invested in this work. One may speculate that at 
the present rate it will take at least five years 
before the system is really perfected, notwith- 
standing certain glowing newspaper accounts about 
systems supposedly now in operation. It is true 
that certain simple-minded things can be done right 
now, certainly the word-for-word translation and 
probably something a little better. It is also 
very probable that some concrete and respectable 
progress will be visible two or three years from 
now but I should estimate at least five years 
before a really satisfactory system is reached. 

Even this may be an underestimate. Before we are 
through, the world-wide investment in these 
programs may be #50 million. 

Now there are good reasons for saying that 
this investment need not be amortized. Present 
research in automatic translation is of great 
scientific interest, will result in deep insights 
into the working of the human mind and in great 
improvements in our use of computers in general, 
and is worth being carried on for its own sake. 

But let us take the narrow viewpoint and insist 
on amortizing it. The question is, over how long 
a period of time or what volume of work? Let us 
assume that, after a number of systems have been 
perfected and the best of them selected, we may 
wish to translate, over a number of years, a total 
of 40,000 volumes averaging 500,000 words each, or 
a total of 20 billion words, so that the amortiza- 
tion cost per word is 1/4 cent. (We may disregard 
interest cost for our purpose.) Forty thousand 
volumes is the size of a medium-sized technical 
library; in a large technical library probably the 
new accessions alone total 10,000 volumes in one 
or two years. Remember, also, that translation 
will be made from and into several languages. 

Thus, even if amortization is extended over a small 
number of years, it will cost only a small fraction 
of a cent per word. In other words, work on auto- 
matic translation systems is an excellent invest- 
ment . 

Equipment Costs 

The current cost of machine translation will 
depend on the hourly cost of the machines used and 
the amount of machine time for translation. 

These factors, of course, vary greatly from 
one machine to another. As a starting point, I 
propose to discuss the most advanced machines on 
which complete performance and cost specifications 
are available, machines like the Stretch computer 
of IBM or the LARC of Sperry-Rand. These computers 
come with a variety of optional equipment, and 
depending on this we may estimate their hourly cost 
at between #1,000 and #2,500. (This is intended to 
be first shift rental, without overhead or other 
charges.) A little later we shall be more spec- 
ific about the kinds of optional components we may 
wish to include in a system designed optimally for 
language translation. 

These computers are scheduled to perform 
operations like addition, subtraction, transfers, 
logical operations, etc. at the rate of about one 
microsecond per operation. Multiplications and 
divisions take longer, but these operations are 
hardly used in a machine program for language 



translation . 

Apart from the computing machine itself, we 
have to consider terminal equipment. There seems 
to be no problem at the output end. Conventional 
tape-controlled printers can print a thousand words 
in something less than a minute, at a cost of a 
fraction of one dollar. More advanced equipment 
will undoubtedly be even more economical. 

The input operations are indeed critical for 
the success of machine translation. If a Russian 
text had to be punched into cards, say, by hand, 
the cost of this operation and of checked and re- 
vised punching to the required degree of accuracy 
would be almost comparable to the entire cost of 
human translation. Fortunately there seems to be 
reason to expect that reading machines, capable of 
scanning printed pages of Russian texts and record- 
ing them on some mechanical medium suitable for 
machine input, will be available in the next few 
years. Some rudimentary machines are already in 
existence. ^ We have no specifications, nor reli- 
able cost figures, for an entirely satisfactory 
system, but it seems likely that the cost of 
operating this equipment will be considerably less 
than the cost of key punching. 

Machine Time Requirements 

I do not yet know how many instructions will 
be required for the entire translation program. It 
will certainly be several thousand, it may be sev- 
eral tens of thousands. Not all of these will be 
executed for every source word which is to be trans- 
lated. Most of them will come into play only 
occasionally, while some will be executed repeated- 
ly for every word. Let us estimate that 10,000 
instructions have to be executed for each word to 
be translated. On next-generation computers the 
execution time of most instructions is about one 
microsecond, so roughly speaking we can translate 
at the rate of 100 words per second. This 
translation process has to be overlapped by reading 
from the dictionary. It has been estimated that 
the dictionary will have to contain 50,000 Russian 
stems, and that even in a highly condensed version 
of storing in the dictionary such as that of NBS , 
about 20 machine words will b^ required for each 
Russian stem. This is a total of one million 
machine words, and is probably also on the high 
side. There is no difficulty about storing one 
million machine words on a magnetic disc memory. 

To read through this dictionary in alphabetical 
order requires about one second, and during that 
second the machine can simultaneously translate 
about 100 words. Thus, the optimal procedure will 
appear to be to read the source text into the 
machine in batches of about 100 words, alphabetize 
them (or rather their roots), read through the 
dictionary and extract the information pertaining 
to these 100 words, and . then proceed to translate 
while the next hundred words are being read in and 
looked up. Internal memory required is only 2,000 
words for the dictionary information pertaining to 
100 source words, storage space for all instruc- 
tions required by the program, and some temporary 
working storage. A total of 32,000 words of inter- 
nal memory is probably adequate. The hourly cost 
of a machine with 32,000 words internal memory and 
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a magnetic disc memory is on the order of #1,000. 

The machine translates at the rate of 100 words 
per second, or at a cost of 1/3 of a cent per word. 
This, as we said before, compares with one cent 
per word by human translator. Since all our esti- 
mates are quite crude, all we can say is that the 
cost of machine translation by this scheme is 
probably not higher than by human translator. 

To see how these economic factors might 
depend on our choice of equipment, let us suppose 
that we wish to use the same translation system 
on one of today's machines, like an IBM 704 or a 
Univac. Here we have no disc memory, so that the 
dictionary has to be stored on magnetic tape. 

This will take something like half an hour for 
reading into the machine. The internal memory of 
these machines is at most 32,000 words, of which 
we can assign at most 20,000 for copying the 
pertinent portions of the dictionary, reserving 
the remainder of the memory for instructions and 
working space. Twenty thousand machine words of 
memory will hold the dictionary information corre- 
sponding to 1,000 source words. Thus, the proce- 
dure would be to read the text in batches of 1,000 
words, alphabetize them, copy from the dictionary 
and translate. The translation time for 1,000 
words, at 10,000 instructions each, is about five 
minutes. Thus the computer is poorly used, most 
of its time is spent in reading through the dic- 
tionary over and over again, and the total time 
required to translate 1,000 words is over half an 
hour, giving a cost of about 15 cents a word. 

Here again, since all our estimates were quite 
crude, we can only say that machine translation by 
means of today's computers is probably more expen- 
sive than by human translator, perhaps by an order 
of magnitude. 

There are other factors to be considered. 

Many instructions are used quite infrequently. 

For example, the Russian word for "and" requires a 
whole subroutine all by itself. Probably many 
other words require special treatment which has to 
be incorporated into special subroutines, and 
these might in the end account for a large portion 
of the instructions used. There is no reason why 
such subroutines could not be stored in the dic- 
tionary with the word to which they pertain. Thus, 
we can to some extent trade dictionary space for 
internal memory capacity. The same can be done in 
reverse order by selecting a fairly large number 
of frequently occurring words and storing all 
dictionary information about them in the internal 
memory, reserving the external dictionary for the 
less frequently used words. 

It may turn out that our extreme insistence 
on conserving space in the dictionary has been 
unnecessary. Perhaps we can loosen up a bit, 
waste a little space in the dictionary and store 
information in less concentrated form. This may 
result in a dictionary several times larger, but 
on future machines all reading from the external 
dictionary will be overlapped by computing, and 
the computing time would actually be reduced by 
this more generous allotment of external storage 
space . 

Our present crude estimates of the require- 
ments for different translation systems do not 
enable us to choose definitely a best among them. 
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Several of them appear to yield cost figures of the 
same order of magnitude. Whichever will be the 
final choice, it does seem likely that some system 
will be found whereby machine translation will be 
considerably more economical than human translation. 
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Fort Sill, Oklahoma 

A FORWARD OBSERVER FIRES ON A TARGET 


A Field Artillery forward observer moving 
with an infantry company in the attack throws him- 
self down behind a small outcropping of rock. He 
glances at the plastic coated map in his right 
hand; he sets the dials on a small device he holds 
in his left hand* He presses a button. In less 
than 3 minutes lethal volleys from two batteries 
of Field Artillery fall on the area occupied by 
the enemy recoilless rifles and automatic weapons 
which have been holding up our advance. The en- 
emy weapons are silenced. The observer again 
turns dials on his hand message generator and 
presses the "SEND" button. He has reported the 
results of the fire. 

Three miles to the rear, the operations of- 
ficer in the Artillery battalion fire direction 
center turns to his battalion commander and says, 
"That completes the fifteenth fire mission we 
have fired today without adjustment, but with ex- 
cellent effect. Our observers are reading the 
maps well; and of course, our computers are com- 
puting the firing data with deadly precision. 

The results of that mission will by now be stored 
on the magnetic tapes at Division Artillery Head- 
quarters and soon will be input to the periodic 
intelligence program •" 

Computers Moving Into The Field Artillery 

Yes. Computers are moving into the Field 
Artillery; and we are studying the implications 
and applications of the computers to Field Artil- 
lery. It is my purpose to tell you something of 
how we expect to fit computers into tactical field 
operational doctrine and training in the years 
immediately ahead. 1 am in the field artillery 
and in a strict sense I am qualified only to speak 
of the applications for computers in that combat 
arm of the Airay. However, the Artillery* s compu- 
ter effort is a part of the total effort and I 
believe you would like to have a brief glance and 
the big picture first, before we narrow down to a 
single combat arm. 

The Army* s ADP System 

The Army has undertaken an ambitious Automa- 
tic Data Processing System development program in 
which we .foresee having the field array served by a 
number of computers working in a plan combining 
independent and integrated action. The total 
number of tasks foreseen for our ADP system num- 
bers in the area of one hundred and ranges from 
the complex mathematics of our required ballistic 
solutions, computed entirely in the forward areas, 


to the data processing of logistical requirements 
dependent on a network of computers and entry de- 
vices. 

I am sure that you can all form an idea of 
the tremendous potential savings in time and money 
available in a computer system encompassing an en- 
tire theater of operations, and even the contin- 
ental United States, that is, the "Zone of the 
Interior". In past wars we have had cumbersome 
depot systems stretching from the home factory to 
the forward combat zone. From the large depots at 
the port of debarkation a series of successively 
smaller depots stretch forward. Maintaining sup- 
plies to fill this long pipeline was a formidable 
task. Knowing what you had available was an equal- 
ly formidable task. Recall that where industry is 
faced with this same problem, industry is not fac- 
ed with constant movement of depots, movement of 
the forward end of the pipeline, rotation of per- 
sonnel, and accident as grave as bombing and shel- 
ling; nor is industry faced with a communication 
system that is in a state of great flux. In sum, 
then our depot system and supply problem in a 
theater of operations has always been inevitably 
costly and inefficient. Enter data processing. 
With rapid accounting, with speedy and accurate 
reply to queries from anywhere in the system, and 
perhaps best of all, with powerful and valid ex- 
trapolation and prediction of requirements we can 
have good management and timely execution; we can 
probably cause whole lines of depots to fall. We 
can save the huge quantities of stocks consumed in 
just filling the pipeline. 

We have great hopes for the data processing 
possibilities ahead for us in a field army - in 
material accounting and in personnel and opera- 
tional accounting of all kinds. 

Now, having sketched out for you the larger 
picture, the field array and the whole scheme of 
computers we plan for it from the large scale to 
small scale, let me return to my own field and 
sketch out what we see in - "COMPUTERS FOR ARTIL- 
LERY". 

Computers in the Field Artillery 

The Field Artillery supports the other two 
ground gaining combat arras - Infantry and Armor - 
and is in turn supported by the remaining arras and 
services. Therefore, by logical extension of my 
remarks you can readily derive from an understand- 
ing of the artillery* s computer problems an under- 
standing of the place of cosqputers in the full 
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field army* 

Among the combat aims - Infantry, Armor, and 
Artillery - the Artillery holds a unique place 
with respect to automatic digital computing. The 
Artillery stands at that point in the field army 
at which technical, mathematical computing meets 
automatic data processing - the instrument of 
logistics, battlefield management, and command, 
(Figure 1). 

Requirements for Mathematical Computing 

First, we will look at the Artillery * 3 re- 
quirements for mathematical computing. Second, 
we will glance briefly at the type of computers 
we have in mind and a possible scheme of loca- 
tion; third, we will look at the Artillery* s re- 
quirements in data processing; and finally, we 
shall look at a portion of the complete system 
within the Field Artillery as it might look a few 
years hence. 

Our requirement for technical, mathematical 
computing falls into three fields: survey (where 
are we on the ground?); meteorology (how strong 
is that head wind?); and firing data (at what an- 
gle shall we point these cannons and rockets?), 
(Figure 2), 

Finding the answer to each of these questions 
presents a straightforward mathematical problem. 
Survey data and meteorological data are normally 
furnished to the artillery battalion by higher 
headquarters, and so will be discussed when we 
come to a discussion of the complete artillery 
system of computers. However, fire control data, 
based on survey and meteorological inputs, is 
computed at the artillery battalion and battery - 
down where the howitzers, guns, and rockets can 
be found. We will begin at the forward end and 
discuss fire control. 

At this point I think I must take a minute 
to define briefly the organizational structure of 
the artillery so that we may use three simple 
terms - battery, battalion, and division artil- 
lery for the rest of the paper. The basic firing 
unit in the Field Artillery is the battery, which 
we can say includes 6 howitzers (guns) and is re- 
presented by the symbol shown in Figure 3 * 

The next larget unit is the Field Artillery 
battalion which includes two or three batteries 
and is shown in figure 4* 

And finally, the third and last unit we need 
to consider is the division artillery, A divi- 
sion artillery will control a number of bat- 
talions, say five or six. Figure 5 shows the 
organization of a division artillery. 

Fire Control Computer 

What computers are we planning for our math- 
ematical requirements and where will we place 
them? For use within batteries and battalions 
and primarily in the production of firing data we 


are now testing a small ruggedized computer, 

FADAC, Field Artillery Data Automatic Computer. 
FADAC has a rotating memory in the 4096 word 
class. The computer is designed to give us a com- 
plete ballistic solution (integration of half a 
score of differential terms representing the in- 
teracting factors of angle of departure, velocity, 
mass, weather, and so on) in less than the time of 
flight of the projectile. We expect great ac- 
curacy from the solution because the program com- 
bines the best of proving ground methods (full 
mathematical solution to the equations of motion) 
and the best of practical experience. The compu- 
ter will store, and constantly update, the empiri- 
cally derived difference between predicted and 
proven results. The end result, we hope, will be 
first-round accuracy for our cannons and improved 
accuracy for rockets. It is also adaptable to 
other problems and can be termed a general purpose 
computer. For all of our artillery computers we 
intend to have removable and changeable labels on 
our operating panels so that the soldier operator 
can talk exclusively in terms of his own problem, 
Reading in a new program is accompanied by at- 
taching a new set of labels for the operator* s 
buttons. We thus have the advantages of being 
special purpose at the moment and general purpose 
through the day. 

Response To Fire Requests 

Tou must bear in mind that in ground combat 
we normally fire cannon as area weapons. That is, 
we mass the fire of a dozen or more cannon on 
small areas (100 meter, 200 meter circles). By 
first-round accuracy we then mean that the impact 
pattern of the first volley fired by the battery 
or battalion will include the target and produce 
lethal effect against it. (Figure 6 and 7 )* 

Let us accept that immediate, accurate re- 
sponse to a requirement to fire is the Field 
Artillery’s primary interest in computers. How 
does this affect the development of a computer 
system? We can return to the forward observer 
we met at the beginning of this talk for an answer 
to the question. 

First let us look at the sequence in the com- 
plete process of firing and then consider the 
actors who are responsible for action at each 
step. In Figure 8 are the elements of the pro- 
blem. Here is an enemy target; the target is seen 
by an observer; the observer requests the bat- 
talion to take an action; the battalion provides 
an officer who makes a decision; the decision as- 
signs the firing to one or more batteries; data is 
computed; and the batteries fire. 

All of these actions can be speeded by their 
inclusion in a complete computer system, A com- 
plete system can help us avoid error. However, 
only the central computer is indispensible to the 
achievement of first-round accuracy; we must have 
a computer for the mathematics of the ballistic 
solution. 

An Automatic Fire Request/Response System 
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Let us look again at the process, with auto- 
matic devices in operation throughout. Figure 9 
shows the full span. First is the observer* s re- 
quest for fire. The observer uses an automatic 
device to report the location of the target. 
(Marked "Message Entry 41 on Figure 10). He could 
use the telephone or radio with only a small 
price in speed (and, of course, a risk of human 
error in number transcription - coordinate loca- 
tion of the target, and so on). Second is the 
decision and order to fire. We receive the mis- 
sion and display it electronically on a device at 
the battalion which I shall call an electronic 
map and switch device (Figure 31). 

The configuration of this device is under 
study. Although the display is automatic, human 
decision (to fire or not to fire, and how much) 
remains with the gunnery officer. Transmission 
of the gunnery officer* s commands to the firing 
battery is automatic. Removing the automatic 
aspects and returning to inspection of a paper 
map (followed by telephone or radio to the bat- 
tery to fire) would cost in time but not accuracy 
(discounting transcription error). And finally, 
there is at the battery a display unit on which 
can be displayed a complete set of fire commands 
and data for the guns. The complete set includes 
gunnery officer instructions to the battery, and 
the output of the computer located at the bat- 
tery or battalion, or both. With only a small 
loss in speed (and again, possible human error) 
instructions and data can go to the guns by voice 
(telephone or radio). We retain accuracy as long 
as the data is generated by computer. 

Recapitulating, we can say that without loss 
in first-round accuracy we can delete from the 
system the observers message entry device, the 
electronic map, and switch device, and the bat- 
tery display unit. We believe in the possibili- 
ties of a complete system; but our greatest tac- 
tical gain - accuracy - can come from a decen- 
tralized array of ballistic data computers not 
linked by data transmission systems (other than 
the old fashioned kind developed by Alexander 
Graham Bell). 

We wish to play the game both ways. We want 
the benefits of a complete system if we can have 
them; but knowing the uncertainties of the bat- 
tlefield we wish small computers well forward - 
in batteries and battalions - to compute firing 
data. This concept of being able to work our 
computers either in a system or independently we 
call the federal-state approach. 

Tmpllca.tions Of Increased Accuracy 

There are a number of significant implica- 
tions in a more accurate solution to the cannon 
and rocket fire control problem. The first is 
that the Artillery will be able to fire many more 
"immediate fire for effect" missions, missions 
in which no adjustment by the observer is re- 
quired even though no recent registration ("zero- 
ing in") has been accomplished. The obvious gain 
in effect on the target is perhaps matched by the 


time and ammunition saved. Observers can fire 
many more missions in a given period of time. 

A reduction in the number of registrations 
(a procedure for determining corrections of the 
moment) will bring a number of benefits. Logis- 
tically we will save in the ammunition moved for- 
ward. How many thousands of rounds were consumed 
in Korea and World War II in registering bat- 
teries and re-registering them repeatedly? We 
know the cost of one round of ammunition at the 
factory in Virginia or Ohio. But what is the 
cost of that round delivered to the 38th parallel 
in Korea? I do not mean to imply that registra- 
tion will not be necessary at times; but the re- 
quirement should be materially reduced since the 
computer gives us a pure ballistic solution in 
which errors are removed or isolated to a degree 
not now possible using the approximations present- 
ed in firing tables. 

Fewer registrations will mean more surprise. 
Units can move into new positions and be ready to 
render effective support without that tell-tale 
pre-firing registration procedure. Tactical 
mobility will be increased. In the past, impend- 
ing darkness and other restrictions on registra- 
tion in a new area have often influenced the dis- 
placement of Artillery units. Time will be saved; 
registrations can be time consuming. Under pre- 
sent techniques missions sometimes have to wait 
or registrations have to be interrupted. 

Unobserved Fires 

In addition to the response to requests for 
fires called in by observers, the Field Artillery 
frequently has to place fires on areas not under 
the observation of a trained artilleryman. The 
fires may be on call from supported units or may 
be part of a predetermined schedule of fires. In 
any event the utmost in accuracy is required. In 
the past this accuracy has not been easy to 
achieve. The normal procedure is to record basic 
data for all targets and update the data every 
two hours as new meteorological information is 
received. The bookkeeping can become extensive. 
The pressure of events can make it impossible for 
fire direction personnel to find time to update 
data for their on-call fires; and even when 
manual procedures are perfectly executed the re- 
sults are limited to firing table accuracy. 

With a fire control computer we will be able 
to store the location of many targets. Basic data 
for fire on the targets can be generated. Cor- 
rection factors determined from firing can be ap- 
plied to new conditions. Receipt of data on new- 
ly determined meteorological conditions will be 
followed by an immediate updating of fire commands 
for all on call fires. The battalion will be able 
to repeat fires on old targets or place them on 
new targets nearly as rapidly as the guns can be 
laid (aimed). 

The handling of relatively large target lists 
is a problem in data processing and fire planning. 
I should like to defer discussion of these 
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topics until later and complete the treatment of 
technical, mathematical computing requirements* 

Missile Computers 

Missile fire control also presents an ob- 
vious application for high speed computers* One 
missile fire control computer - the Jukebox for 
Redstone - has been delivered and is in the hands 
of operational units* The Jukebox computes, in 
five minutes, the complex mathematical problem 
requiring two trained operators working with 
electric desk calculators two and one half to 
three hours to complete by hand. In the case of 
missile computers the gain is in speed* The hand 
solution produces accurate results for the Red- 
stone but is intolerably slow. Every missile 
type will have to be assigned an appropriate fire 
control computer. Because all computers are po- 
tentially general purpose machines, the same com- 
puter that computes the fire control problem for 
one missile can be reprogrammed to compute the 
problem for another. There are technical limita- 
tions in some cases, of course. But there will 
inevitably be a trend toward standardization and 
limitation of types of digital computers for the 
Artillery. 

Topographic Survey Computing 

Topographic surveys in the battle zone are 
accomplished by the combined effort of survey 
teams at successive organization levels. The 
battalion has a survey function, the division 
Artillery has a broader function, and so on* Ac- 
curate and rapid exchange of information between 
the survey echelons is imperative* Therefore the 
Artillery survey requirement presents a problem 
in both straightforward mathematical computing 
and in data processing* 

The end product of the Artillery survey sys- 
tem is the precise location in three dimensions 
of points in the areas of the Artillery battal- 
ions, and the target area, plus the establishment 
of a reference direction* To reach this end the 
survey system must build a network of computa- 
tions and establish a set of centers for exchange 
of data* There is currently under development a 
computer-supported system for this task. Regard- 
less of whether the complete integrated task can 
be performed automatically in the near future, 
isolated computers at the appropriate various 
levels, supported by conventional communications, 
the “federal-state approach”, will speed the pro- 
duction of data required for accurate artillery 
fire. 

We are studying message entry schemes and 
believe that the battalion survey requirements 
might be met by a message entry device having 
access to the battalion computer in the battal- 
ion operation section, the fire direction cen- 
ter; or it might be better to allocate a compu- 
ter to survey at battalion under some conditions* 
In any event there could be associated with the 
battalion computer in the fire direction center, 
a survey panel for the processing of survey pro- 
blems when the scope of these exceeds the capa- 
bility of the fire control panel. At division 


artillery level it is possible that the survey 
requirement, with particular reference to the 
work of advance parties during displacements, will 
be such that a small computer individually vehicu- 
larly mounted must be assigned to survey. This 
computer would itself be served by both message 
entry devices in the hands of survey parties and 
by the survey control panel on the computer. The 
«mAl1 survey computer would, of course, have ac- 
cess to the medium size computer at division 
artillery headquarters. 

Meteorological Data For Fire Control 

The preparation of meteorological data for 
the artillery has been developed and refined over 
the years. Yet, the timely preparation of the 
required data by hand necessitates the use of 
graphic methods and approximations that cannot 
possibly give the precise results to be expected 
from a high speed machine. The application of 
computers to generation of meteorological mes- 
sages for artillery fire control is an immediate 
prospect. The dissemination of the results auto- 
matically through the system is the obvious corol- 
lary. 

Meteorological data is generated at division 
artillery and higher headquarters* The “met” sec- 
tions, almost alone among our Artillery elements, 
have an on line computing problem with a real-time 
consideration* (There are radar tracking problems 
with a real-time aspect). We track a rising bal- 
loon which is constantly transmitting to the 
ground station* We can solve this problem either 
by the assignment of a small computer to the metro 
section (which may wish to locate at a point well 
away from division artillery headquarters), or we 
can use real-time registers and interrupt features 
on a medium scale computer at division artillery, 
or We can digitize the data from the analog in- 
put at the met station and transmit it to the 
medium scale computer at division artillery. 

The computer requirement for mathematical 
computing in a division artillery then looks 
something like what is shown in Figure 12. 

Data Processing Requirements 

What are some of the Field Artillery appli- 
cations in the data processing field? Many of 
the applications are straightforward, business 
accounting type problems, challenging because of 
their magnitude and because of the requirement to 
bring together the input from many sources. 

Other applications are less straightforward - or 
at least less obvious in their outline - and are 
therefore even more challenging to the artillery- 
man concerned with problem formulation and analy- 
sis. In the second group, there is particularly 
fire planning. 

Fire Planning by Computer 

The Artillery and Missile School has a pro- 
gram which can be placed on the commercial model 
computer at Fort Sill (a Bendix G-15D) to pro- 
duce a schedule of non-nuclear fires at division 



213 

7.3 


artillery level. For example, one problem we run 
plans the fires of 21 batteries against 97 tar- 
gets in execution of a schedule of firing that 
occupies the batteries for one hour. 

Schedule of Fires 

This is a first bite into a larger problem. 
Fire planning includes much more than producing 
a fixed schedule of fires. However, the Artil- 
lery School’s program works; it demonstrates that 
fire planning in terms of a fixed schedule of 
fires can be accomplished in a few minutes on a 
small tactical type computer. The implications 
are enormous. If the laborious process of fire 
planning can be reduced to an operation requiring 
on the order of ten minutes or less, the reaction 
time of large units can be drastically reduced. 
This may materially change the tactics of divi- 
sions and larger units. 

When visitors to Fort Sill are first shown 
the fire planning program they frequently display 
certain incredulity. The objection is us ually 
made that there is either some hocus-pocus about 
the operation or the results are poor indeed; for 
no machine can have the battle experience (and 
judgment derived therefrom) that we can expect 
from consnanders and staff officers. 

The objection is valid - to a point. How- 
ever, it breaks down for two reasons , First, 
many of the operations in fire planning that we 
view as the exercise of judgment involve not 
judgment but routine response to a routine situ- 
ation, And second, the machine can be made to 
accept human judgment in a number of ways: in 
establishing initial criteria; in modifying 
criteria in response to human direction when in- 
termediate answers are presented to the operator 
for inspection; or through processes that, with a 
slight stretching of terms, can be called self 
education of the computer. 

Judgment In Fire Planning 

Consider the first point - that we often call 
the exercise of judgment what is in fact merely 
a standardized response to a standardized situ- 
ation, For example, a fire plan to support a 
division Artillery non-nuclear schedule of fires 
is being prepared. The staff officer scheduling 
the fires of the available Artillery battalions 
has run down his target list and has come to a 
target labelled "machine gun". Map coordinates 
are listed, recent intelligence indicates the 
weapon is most probably where shown; the weapon 
is on high ground and in the general line of ad- 
vance of the attacking force. The staff officer 
produces response A; that is, he schedules a cer- 
tain caliber and volume of fire against the en- 
emy weapon, 

A moment later the staff officer comes to 
another item on the target list that meets the 
same general description. The planner again has 
response A. There may be a difference between 
the two enemy weapon positions; but by the time 


fire is planned this difference has left the 
intelligence picture. The difference may have 
been too subtle to permit reporting in the ac- 
ceptably brief report procedures. It probably 
follows that the difference between the two wea- 
pons was also immaterial from a tactical point of 
view. The point is that the staff officer fire- 
planner has a standard reaction to a standard 
piece of intelligence. Much of fire scheduling 
is along these lines. Many officers drawing up 
schedules of fire have had only a general view 
of the terrain. 

Admittedly, there is quite a difference be- 
tween a machine gun in the line of advance (or 
with a field of fire covering the line of ad- 
vance) and a gun answering precisely the same 
description but located at a point too distant 
to permit it to fire on the attack element. But 
this is geometry and can readily be resolved by 
the machine. 

The objection may be that the computer’ s re- 
action is too rigid, no room is permitted for 
variations in plan, situation, or the many impon- 
derables of the battlefield. The answer is that 
more refinements will be built into our program 
as thought is given to them; and, perhaps more 
important, the program can permit modifications 
of criteria by the commander at any time. A 
particular enemy is particularly resistant to pre- 
viously accepted standard quantities and types of 
fire, Very well, the conmander directs that the 
standard reaction to a particular type of target 
be increased, doubled, or changed in degree in 
any way the battle experience and judgment of the 
commander or his staff suggest. 

Determining Who Should Hit 

The fire scheduling problem is composed of 
two phases. First, the machine must determine 
which units can hit which targets; then the 
machine must determine which units should hit 
which targets. Determining who CAN hit is 
straightforward mathematics. Locations of gun 
and target are compared, azimuths of fire are con- 
sidered and a tabulation of the results stored. 
Considering one target at a time the machine ex- 
amines each firing unit in the appropriate cate- 
gory that can hit that target and determines if 
the desired effect can be achieved by assigning 
the unit to fire on the target. Successively, 
acceptable units are assigned until the desired 
level of damage can be predicted. This is the 
who SHOULD hit phase. It presents the more sub- 
tle and interesting problem. 

Inputs r "Target Word”: "Fire Unit Word" 

To begin the problem we present the computer 
two sets of organized inputs - data with respect 
to the targets, known as the "target word"; and 
data with respect to the fire units, known as the 
"fire unit word," The conmander’ s criteria are 
set in as constants and can be changed with each 
plan. 
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Target Word 

The target word consists of ten pieces of 
data, as shown in Figure 13* 

The concentration number is a tag that will 
permit identification of the target on the final 
fire plan. The tag will also permit deleting the 
target from the target list at any time prior to 
computing the fire plan* The coordinates are 
standard military map grid coordinates (a 
cartesian system) and form the basis for the 
determination of CAN hit lists by the computer. 

The next seven items in the target word are 
what permit the blend of routine reaction and 
judgment (or camaander’ s preference) in the 
machine developed plan* The seven items are all 
determining factors in the who SHOULD hit con- 
sideration* The final item, the date time group 
is necessary for intelligence processing* 

Target Types 

If there is to be a standardized reaction to 
targets there must be standardized targets* The 
working group at the Artillery and Missile School 
have developed a standard list of targets com- 
posed of three classes and, within those classes, 
a total of ten types* We consider that all non- 
nuclear targets presented to a ground force for 
planned fires can be categorized as falling with- 
in one of the types of targets shown in figure 
14. 

Total Target Description 

In addition to affixing a type number to 
each target it is assumed that the observer or 
other acquisition source will give some indica- 
tion of the size of the target* An index of the 
magnitude can be conveyed under two convenient 
designators} scale, a measure of physical dimen- 
sions} and density, a measure of the number of 
personnel within the target area* Necessarily, 
both designators must be simple and stated in 
general terms* It has been tentatively decided 
to use two scales, and two densities* Scale 1 
is a target, varying from a virtual point to ap- 
proximately 120 meters in diameter} and scale 2 
is a target of 121 to 240 meters in width, the 
depth remaining at 120 meters* The depths and 
widths are associated with the normal dispersion 
pattern from average battery gun positions for 
the calibers under consideration* Large target 
areas are attacked with conventional artillery 
only after they have been broken down into smal- 
ler targets of the scales mentioned. 

With ten target types, 2 scales and 2 densi- 
ties (density 1 is considered normal, density 2 
a thickly packed target such as an assembly area) 
there is ample opportunity for adequate target 
description for immediate response to requests, 
and for intelligence file purposes. Ten target 
descriptions - more than ample* 


Reliability is a reflection of the assurance 
that the target is in fact as reported. Targets 
are "confirmed” or "suspect". 

Priority. Phase and Repeat 

Priority ha3 been established as 1, 2, or 3 
for the machine. Priority 1 targets are those 
that must be hit or the fire plan is considered 
unacceptable and the machine halt 3, awaiting new 
instructions. Before halting, the machine at- 
tempts to shift the center line on which the bat- 
talions and batteries are laid to see if a slight 
change will increase the unit’s capabilities. 
Priority 3 targets are those that can be most 
readily discarded* This leaves, by forced defini- 
tion, priority 2 targets as those that should be 
hit if possible, and should not be discarded until 
after priority 3 targets are discarded* Priority 
can be determined by the computer from the other 
parameters. There is provision for override so 
that the local situation and the commander’s de- 
sires may govern* 

The remaining two items in the target word 
determine whether the target is fired on early, 
late^ or in the middle of the schedule of fires} 
and whether or not fire is repeated on the target. 
Each target is tagged with the desired action as 
it is entered into the machine* If the commander 
wishes to make it a matter of standard procedure 
that certain types of targets are always fired on 
in a certain phase of the fire plan, this can be 
built into the program. The time required to 
input target data will be reduced accordingly* 

The same applies to the repeat requirement. Fire 
can be repeated once, twice, or, in long plans, 
three times* 

Fire Unit Word 

It is also necessary to furnish the computer 
data on the fire units. The "fire unit word" con- 
sists of six parts* I shall not discuss them in 
detail as the purpose of each piece of data is 
fairly obvious* The fire unit word is shown in 
Figure 15. 

Supporting the Fire Plan Formulator 

The Fire Plan Formulator, the name given our 
program, modified and revised as it will be at 
the Artillery and Missile School, will produce a 
fire plan in less than a quarter of an hour - per- 
haps in as little as 5 minutes on a smal l , high 
speed tactical machine. The Fire Plan Formulator 
needs to be supported by devices and procedures 
that work together as a system to provide an up- 
dated target list from which fire plans can be 
made on call* The Formulator also needs the sup- 
port of devices and procedures that will take a 
finished fire plan and disseminate it to the many 
points on the battlefield where it will be used. 

Such an integrated scheme of devices and 
procedures is in prospect for the Artillery. It 
is part of the Army’s Automatic Data Processing 
System* The Army’ s effort to develop an automa- 



tic data system for the field army (ADPS) is be- 
ing especially directed toward the Field Artil- 
lery subsystem which will be first subsystem 
developed* 

The Artillery Subsystem for AJDPS 

The U. S. Army Electronic Proving Ground, 

Fort Huachuca, Arizona is responsible for execu- 
ting the analysis and programming of the full ADP 
system. The Field Artillery is actively support- 
ing development of the Artillery subsystem. 

In Figure 16 I have shown part of the Artil- 
lery subsystem up through division artillery as 
it may appear when allocations of computers are 
ultimately determined. Recall that a division is 
a body of men in the 10,000 to 15,000 class. The 
Division Artillery consists of six battalions and 
a Division Artillery Headquarters, A possible 
computer system for one battalion and the divi- 
sion Artillery is shown in Figure 16, 

Note again our familiar forward observer, 
the battalion fire direction center, and the 
filing batteries in which are located the guns; 
notice the survey computer; and the division 
Artillery computer. 

To perform the fire data computation (bal- 
listic solution) which was discussed in the be- 
ginning of the paper we have the computer in the 
battalion - here. We also plan computers for 
fire control at the batteries. But now we have 
discussed fire planning and we have implied other 
applications, ammunition accounting, target list 
updating, survey, and others. The central pro- 
cessor (computer) at battalion must be fortified. 

We will probably require memory increments 
or augmentation in same form to raise the capa- 
city of the memory to 8 or possibly 12,000 words. 
In the forward areas such as battalion we do not 
believe that we can ever work with the fragility 
inherent in magnetic tape drives as we now know 
them. Memory augmentation might be through the 
use of special ruggedized canisters carrying end- 
less loop magnetic tapes; or possibly the time 
requirement for program change will be sufficient- 
ly generous to permit the use of endless loop 
paper tape canisters. 

We have under development for the ADP sys- 
tem, small core memory computers (known as COMP AC) 
which have a capability for the addition of mem- 
ory modules in units of 4096 words at a time. 

This may be an unnecessarily costly means of 
memory augmentation for us. Just what computers 
and peripheral equipment will be used in the 
small units cannot be stated with any certainty 
at this time. 

There is one interesting feature that we 
have introduced here and that is the multiple 
console concept. Around a single central pro- 
cessor (core memory, parallel computer) we have 
three consoles, or operating panels. The panels 
are labelled with the precise problem parameters 
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of "fire control", "operations", and "survey** so 
that our soldier computer operators can be skil- 
led artillery men first, and computer operators 
second. We cannot afford long periods of train- 
ing for computer operators. A real time inter- 
rupt feature on CQMPAC permits apparent multiple 
entry to the computer. 

At the division artillery we have a medium 
scale computer known as the BASICPAC. This com- 
puter has a core memory of 4096 words which is 
readily expandable by the addition of memory 
modules of 4096 each. In addition to the 4 to 
8 standard magnetic tapes available to it, the 
machine will handle half a dozen input/output de- 
vices. The computer is further fortified and 
brought to the medium class by the numerous in- 
dex and real time registers. 

A Complete System 

Here we begin to have the heart of a true, 
relatively full computer system. The division 
artillery computer can query and respond to 
querries of the six battalion computers and to 
the survey computer working directly out of the 
Division Artillery Headquarters. The division 
artillery computer receives data for computation 
from the met station, or it receives for retrans- 
mission a met message computed at the met sta- 
tion. Our system now gives us survey, and 
meteorology - the full set of mathematical re- 
quirements less fire control which i3 accomplish- 
ed at lower levels. The BASIGPAC also gives us a 
capability for fire planning, target list pro- 
cessing, intelligence functions, anmunition ac- 
counting, march planning, and numerous adminis- 
trative and logistical functions. 

We now begin to look like part of a system. 
BASICPAC canmunicates with the Division Head- 
quarters, with each of its own six battalions, 
and with the next higher Artillery Headquarters 
(known as Corps Artillery). Unquestionably, the 
system will give us great power. 

As you can see, we in the Artillery have our 
work cut out for us in keeping one jump ahead of 
the hardware, in analyzing our problems, and in 
planning the integration of the data handling of 
each problem into our system. We understand and 
appreciate the power of this great new weapon. 

It is up to us to do some hard thinking and pro- 
duce the best possible methods for using it. We 
intend to try. 
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Among the combat arms 
—Infantry, Armor, and Artillery— 
the Artillery holds a unique place with 
respect to automatic digital computing. 
The Artillery stands at that point in the 
field army at which technical mathemat 
ical computing meets automatic data 
processing— the instrument of logistics, 
battlefield management, and command. 

Figure 1 


NEED MATHEMATICAL 

COMPUTING 


SURVEY (Where are we?) 

METEOROLOGY (How strong is the 

wind?) 

FIRING DATA (At what angle shall 

we point our cannon 
and rockets?) 


Figure 2 
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COMPUTERS IN FIRE CONTROL 



Figure 9 
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Figure 12 


TARGET WORD 

1. CONCENTRATION NUMBER 

2. COORDINATES 

3. TYPE 

4. SCALE 

5. DENSITY 

6. RELIABILITY 

7 . PRIORITY 

8. PHASE 

9. REPEAT 

10. DATE/TIME 


Figure 13 




TARGETS 

CLASS TYPE 

I - PERSONNEL IN OPEN 0. INFANTRY IN OPEN 

1. OP IN OPEN 

H - PERSONNEL IN SHELTER 2. INFANTRY DUG IN 

3. COMMAND POSTS 

4. INFANTRY WEAPONS 

5. OP DUG IN 

IE - PERSONNEL/MATERIEL 6. MORTARS 

7. ARTILLERY 

8. ARMOR 

9. SUPPLY DUMPS 

Figure lit 


FIRE UNIT WORD 




UNIT NUMBER 
COORDINATES 
AZIMUTH OF CENTER LINE 




CALIBER 

NUMBER OF BATTERIES 
MISSION 


Figure 1 5 











225 

8.1 


COMMUNICATIONS WITHIN A POLYMORPHIC INTELLECTRONIC SYSTEM 

George P. West, Ralph J. Koerner 
Intellectronics Laboratories, Ramo- Wooldridge 
Canoga Park, California 


Summa ry 

A multiple -computer system, employing several 
computers of intermediate capacity, provides 
many system advantages over conventional large- 
scale computing systems. A multi -computer sys- 
tem requires effective communication of data and 
control signals between modules of the system. 
This paper briefly describes the RW-400 Data 
Processing Central, a new polymorphic data proc- 
essing system. The communications problems 
associated with such a system are discussed, and 
a high-speed electronic switch employing multiple - 
aperture magnetic cores to provide system com- 
munication is described. The utilization of this 
high-speed switch in providing efficient data com- 
munication, as well as effective control of the sys- 
tem modules, is explained. 


With increased automation of large-scale data 
processing applications, the trend has been to 
larger memories, higher clock rates, andincreas- 
ingly complex arithmetic units. This approach is 
not entirely satisfactory, particularly from cost 
and reliability standpoints. A multiple -computer 
approach employing several computers of inter- 
mediate capacity provides many system advantages . 
A novel organization of more or less conventional 
computational elements, communicating in a com- 
mon format by means of a high-speed electronic 
switching network, provides a data processing 
system which can assume many different forms. 
Such a polymorphic system can be more nearly 
tailored to the requirements of a particular appli- 
cation, since storage capacity and arithmetic capa- 
bility can be adjusted independently, in relatively 
small increments to match the requirements at 
hand. Furthermore, changing operational require- 
ments can be matched by expanding the system 
economically, without obsolescence of existing 
equipment or programs, merely by adding addi- 
tional computers or memory devices. Finally, 
the system can be programmed to adjust its own 
performance to meet rapidly changing operational 
requirements by shifting computational capacity 
from one assignment to another. 

The RW-400 system is a multiple -computer 
system of this type, capable of simultaneously 
processing multiple tasks with extremely short 
time requirements for the completion of each task. 
The Computer Modules of the RW-400 system em- 
ploy a powerful two-address order code providing 
three optional forms of each arithmetic instruction. 
These options may be utilized to specify the con- 
tents of the accumulator as one of the operands, 
providing in effect a two-plus address structure. 
A magnetic core memory of 1024 -word capacity, 
and 10-microsecond memory cycle, is provided 
with each Computer. Control and synchronization 
of computations is exercised by a generalized pro- 


gram interrupt feature. Each Computer is capable 
of recognizing one or more external interrupt con- 
ditions to force a branch in the computer program. 
Each Computer is provided with a masking regis- 
ter which can be loaded by the computer program 
to specify which of the 13 possible external inter- 
rupt conditions, as well as which of the internal 
interrupt conditions, will be effective at any in- 
stant of time. 

The system employs novel magnetic core 
buffer memories (Buffer Modules) to augment the 
memory provided with each Computer Module. 
The Buffer Modules make possible rapid, effi- 
cient transferral of data and programs within the 
system. Each Buffer Module is capable of inde- 
pendently executing a stored program which 
permits it to gather data from magnetic tapes, 
drums, or the system input-output buffers. Simi- 
larly, the Buffer Module can distribute results 
under the control of its own stored program. The 
Buffer Module may also be used to augment the 
memory of any of the Computer Modules; in this 
mode, operands taken from the Buffer memory 
may be utilized by the Computer Module almost 
as rapidly as operands from the Computer's in- 
ternal memory. 

The system employs magnetic tapes, mag- 
netic drums, printers, plotters, and input- output 
buffers as independent modules. All of these 
modules communicate with one another by means 
of a high-speed electronic switch, similar in 
many respects to a telephone exchange. Figure 1 
illustrates a typical system configuration. The 
Central Switching Exchange can be tailored to 
the needs of the system, and may easily be ex- 
panded to accommodate system growth. The 
remainder of the paper discusses the design ob- 
jectives for an adequate central switch, and de- 
velopes a switch design which fulfills these design 
objectives . 

The data switch must provide multiple paths 
so that each Computer can communicate without 
delay or interference. Control of the switch points 
must be rapid (less than 100 microseconds) and 
convenient so that Computer time is not wasted in 
manipulating the switch. The switch must test for 
a busy condition before making the connection, to 
prevent interference with communications in prog- 
ress. The switch should be reliable, providing 
alternate paths for communication so that failure 
of a cross-point does not disable the system. The 
switch should be modular to provide for system 
growth. Since new problems must be checked out 
while the system is in normal operation, a means 
of restricting the accessibility of modules must be 
incorporatedin the design. For example, in check- 
ing out a program that involves a Computer, a 
Buffer and two Tape Modules, it is necessary to 
restrict the switching capability so that the Com- 
puter and Buffer can control connections only be- 
tween the modules being code-checked. This feature 
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prevents the new program from interfering with 
normal system operation. 

The RW-400 data switch employs transfluxors 
(double -aperture cores) as the switching element. 
The principle of operation is analogous to that of 
a transformer connection which can be disabled by 
saturating the iron to prevent the flux linkage of 
normal transformer action. Some of the desirable 
properties of the transfluxor switching element 
are extreme reliability, small size, low cost, low 
power consumption, connection memory, ease of 
control, and bilateral signal flow characteristics. 
Figure 2 is a diagram of a transfluxor crossbar 
s witch. 1 In this array, the transfluxors are se- 
lected by coincident currents. 

Assume a "connection" to be made and signal 
flow to be taking place between Signal Input 2 (S^) 
and Signal Output 3 (O 3 ) through core 10. Signal 
flow consists of a train of alternately positive and 
negative pulses of current along S£ which are in- 
ductively coupled by core 10 to O 3 ; O 3 in turn is 
attached to the load. 

To "disconnect" or open the path between S^ 
and O 3 , a blocking pulse of current is introduced 
along blocking line B 3 . This blocking pulse blocks 
allpossible signal paths for CL, specifically cores 
9 , 10 , 11 , and U. 3 

To establish a new connection, between S 3 and 
O 3 for example, core 11 must be unblocked. This 
unblocking is accomplishedby passing coincident 
unblock currents through lines UB 3 and B 3 . 

A transfluxor plane is provided for each signal 
line required between the computation modules. 
The RW-400 switch provides separate line groups 
employing 18 lines for communication in each di- 
rection. Consequently, the RW-400 switch employs 
36 transfluxor planes. Control of the switch is 
obtained by providing an input commutator which 
samples each of the active input lines in turn. The 
commutator position controls one axis of the coin- 
cident-current selection, while the other axis is 
specified by a Command Output instruction from 
the Computer or Buffer being sampled by the com- 
mutator. An additional transfluxor plane is pro- 
vided to record the status of each cross-point. In 
this way, it is possible to test the additional plane 
to determine whether the connection requested is 
available or busy. If the device requested is busy, 
the connection request is rejected without inter- 
fering with the communication already in progress . 
This basic switch configuration meets all of the 
design objectives except two, which are discussed 
below. 

Control of the switch is rapid. Since the com- 
mutator speed depends upon the number of connec- 
tion requests which must be serviced, the service 
time for a switch instruction will vary. Average 
service time is estimated to be about 50 micro- 
seconds. The switch tests for a busy condition 
before making a connection. Magnetic switching 
elements promise high reliability. Alternate paths 
are available by employing Buffer Modules in the 
transfer. The switch is modular in the sense that 
all circuits for a given set of input or output lines 
are packaged on a single 19 -inch insert card. A 
cabinet with power supplies and all the transfluxor 
planes forms the basic switch. As additional mod- 


ules are required in the system, line circuits are 
added as required. Line circuits constitute about 
one -half of the cost of the switch. 

One of the design requirements which is not 
met by this simple design is the ability to restrict 
the setting of s ome of the cros s -points . This design 
requirement to inhibit certain connections could 
be met by adding another transfluxor plane in which 
allowed cross-points would be recorded by block- 
ing or unblocking the transfluxor corresponding to 
the cross-point. The memory plane would then be 
tested to see if the connection is allowed, in the 
same way that a test for a busy circuit is made. 

The other design requirement which is not met 
is harder to rectify with this simple design. Con- 
trol of the cross-points of the switch in this design 
is not particularly convenient, since the addresses 
used in the switching commands must correspond 
to the physical location on the switch matrix of the 
module selected. A design which would permit the 
programmer the use of arbitrary tags in his pro- 
gram, for the modules he intends to use, would be 
much more useful. When the program is complete, 
the tags used by the programmer could be entered 
in a memory in the switch before the program is 
executed. In this way, the switch could properly 
interpret the tags and select the desired module. 
In other words, a system of symbolic addressing 
for the switch would be much more convenient than 
an absolute addressing scheme. A very rapid table 
look-up operation is necessary to interpret the 
symbolic addresses within the switch. 

A transfluxor memory device , called an Inter- 
rogation Module, which performs this table look-up 
very rapidly, has been developedby Ralph Koerner 
of Ramo -Wooldridge. The transfluxor memory em- 
ploys two cores to store each bit position, one for 
each of the two possible states of the bit. The bits 
of the words which constitute the table are entered 
in parallel along one axis of the memory plane, 
while word positions along the other axis of the 
plane are designated by a read-in signal. Figure 3 
illustrates the principle of operation of a three-bit 
Interrogation Module. The normal or clear posi- 
tion would have all cores unblocked. Words are 
entered into the memory in such a way that a logical 
"one" unblocks the "one" core andblocks the "zero" 
core of the pair of cores associated with the bit 
position, while a logical "zero" blocks the "one" 
core and unblocks the "zero" core. If, now, the 
complement of a number is used to interrogate the 
memory along the vertical lines threading all word 
positions, all of the output lines will have signals 
coupled to them by the transfluxors except that line 
corresponding to the memory position which stores 
the word exactly matching the word at the input 
of the interrogation unit. For that word only, the 
input signals will all traverse blocked cores , since 
the stored word and the interrogating word are 
complements. This Interrogation Module can be 
incorporated in the design of the switch to provide 
symbolic addressing; but even more important, it 
greatly simplifies the problem of interrupt com- 
munication within the system. 

Interrupt communication is necessary to co- 
ordinate, synchronize, and control the operations 
of the individual Computers to provide the best 



overall performance. The interrupt feature is a 
means of forcing a branch in the computer program 
by the use of an external electric signal such as 
is generated by depressing a push-button. In the 
RW-400 system, most interrupt signals originate 
fromComputer or Buffer Modules; however, other 
devices such as the input-output buffers, provide 
signals which can be utilized’as interrupt signals. 
When interrupted, a Computer must determine the 
meaning of the interrupt signal and its source. Some 
information as to intent or source of the interrupt 
can be conveyed by the bit position in which the 
Computer receives the interrupt. Unless a directed 
system of interrupts is employed, the bit assign- 
ments are rapidly used up specifying which Com- 
puter or Buffer should respond to the signal. 

As an additional means of coordinating system 
actions, a digital clock is provided which can be 
read by any Computer or Buffer. The clock is more 
useful when it generates an interrupt signal at the 
end of a prescribed interval of time, directing it to 
the appropriate Computer or Buffer. The directed 
interrupt is an important system requirement. 

Figure 4 illustrates a switch design which em- 
ploys an Interrogation Module to control the simple 
data switch described earlier. The commutator 
samples the input lines in turn, routing the address 
tag specified by the Command Output instructions 
to the Interrogation Module. In general, the com- 
mutator position determines the selection along one 
axis, while the output of the Interrogation Module 
determines the selection along the other axis. The 
Interrogation Module is provided with additional 
word positions which are used to route interrupt 
signals to specified Computers. Note that the digital 
clock is sampled along with the Computers and Buf- 
fers at the input to the switch. By placing a time 
value in one of the interrupt words of the Interro- 
gation Module, an interrupt is generated when the 
digital clock value equals the pre- stored time. In 
this way, Computer interrupts can be generated at 
specific time intervals. In normal system opera- 
tion, Buffer Modules read or write the tape or drum 
records so that Computers usually accept data from 
Buffer Modules or deliver data to Buffers. For 
this reason, it is convenient to permit a Buffer to 
connect itself to a Computer if the Computer con- 
nection is not busy. When the Buffer stops its self- 
instruction program, interrupting the Computer, 
the Computer can use the results immediately with- 
out loss of time manipulating the switch. This type 
of connection is obtained by assigning either Corn- 
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puters or Buffers entry to both the x- and y-axis 
of the switch. The figure shows Buffers on both 
axes of the switch. If there are fewer Computers 
than Buffers in the system, the Computers would 
receive this double assignrhent soas to conserve the 
number of y-connections available to the system. 

The Interrogation Module provides a ready 
means of restricting accessibility of certain cross - 
points of the switch. The word length is increased 
beyond that required for the selection address, and 
the extra bits are interrogated by the commutator 
position, while the selection address interrogates 
the rest of the word. Selection is inhibited unless 
the correct commutator code is stored in the mem- 
ory of the Interrogation Module. Note that, by 
interrogation with "zero's" on both wires of a bit 
position, no signal is generated at the output, re-’ 
gardless of how the cores are set. In this way, it 
is possible to ignore certain fields of the interro- 
gation word. If the commutator position interro- 
gation occurs using an assigned bit for each com- 
mutator position, more than one device canbe per- 
mitted to control a given switch selection. This is 
accomplished by setting into the commutator field 
of the Interrogation Module the proper bits for that 
selection word. In this way, identical switch ad- 
dress tags may be used concurrently in two differ- 
ent computer programs without confusion, since 
each will be assigned a different commutator bit 
by the Master Computer in setting up the assign- 
ment table. The bit will differentiate between the 
two Computers which are in simultaneous opera- 
tion; thus, the switch can distinguish between the 
two identical address tags by determining which 
Computer is using the tag. 

The use of the Interrogation Module in conjunc- 
tion with the transfluxor switch provides a very 
convenient means of solving the difficult communi- 
cation problem associated with data switching and 
interrupt control within a polymorphic computer 
system. 
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Figure L 

SWITCH DESIGN EMPLOYING INTERROGATION CONTROL 
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ENCODING OP INCOMPLETELY SPECIFIED BOOLEAN MATRICES 

T. A. Dolotta 
E. J. McCluskey, Jr. 

Princeton University 
Princeton, New Jersey 


Summary — The problem discussed in this paper is 
a generalization of the problem of choosing the bi- 
nary codes for the various operations of a digital 
computer with a view towards minimizing the gating 
circuitry in the computer's central control. The 
general procedure is a systematic method for the si- 
multaneous encoding of a large number of arbitrary 
Boolean functions in an economical manner. The 
method is applicable to those cases where the de- 
signer is allowed to choose the independent variables 
so as to minimize the circuit which realizes the 
given Boolean functions. Other applications of the 
procedure are indicated. The method given includes 
a means for determining a lower bound on the cost of 
the circuit which realizes the given Boolean func- 
tions. The entire procedure is systematic and lends 
itself well to mechanization via a digital computer 
program. A discussion of the advantages and limita- 
tions of the method presented here is included, and 
the method is compared with a previously published 
procedure for attacking the same problem. 

Introduction 

In the design of the control part of a digital 
computer, there may often arise the following prob- 
lem: We are given the number of bits reserved for 
the operation code, and also the various control 
signals that have to be generated .for each operation, 
and we want to choose the codes for the various op- 
erations so that the control signals can be obtained 
from the operation codes by a simple and cheap net- 
work. 

We will assume that we have m operations to encode 
with n bits, (of course, m< 2 n ) , and that we have to 
generate p control signals. For each operation, we 
can then specify which control signals must be ON 
(i.e., equal to "1”), which must be OFF (i.e., equal 
to "0"), and which may be either ON or OFF (i.e., 
"don't cares"). We now formally state the problem. 

I . Problem Statement 


lower bound on the cost of synthesizing the p func- 
tions. 

II. Assumptions and Definitions 

1. The number of independent variables, n, has to 

be specified externally, subject to the re- 
striction that 2 n > m. 

2. The n independent variables, denoted by 

x^, x_, ...., x , are available together 
with their comp?ements at no cost. 

3. The p functions F, , ...» F are to be syn- 

thesized from tne independen? variables by 
using 2 stage diode logic only, (inverters 
are not allowed.) 

4. The cost of synthesizing these functions will 

be defined as the number of diodes required 
by the circuit designed under the above 
assumptions. 

5. Given a column, F. , we define the complement of 

F. , denoted by as the column obtained by 
changing all then's in F^ to l's and all the 
l's to 0's. The 0's are not affected. 

6. F i will be called a 0(l) -column if it contains 

no l's (0's). 

7. F^ will be said to be specified in any of its 

m positions if that position contains a 0 or 
a 1. Otherwise (if the position contains a 
0) , Fj_ is said to be unspecified in the par- 
ticular position. 

8. Given two columns, F. and F., there will exist 

the intersection or F. ana F., written as 
F. • F., if and only If F. and F. agree (have 
tne same symbol) whereverboth F* 1 and F. are 
specified. F. • F. (= F . • F ± ) will be J de- 
fined as a column 8f length a which agrees 
with both F^ and F • wherever either is speci- 
fied and contains 0's everywhere else. In 
figure 1, columns A and B have an intersec- 
tion, which is shown as column D; columns A 
and C' also intersect. Columns B and C do 
not intersect. (We write this as B • C = A) 


Assume that we have p columns, F^ to Fp, each 
column containing m symbols. There are 3 allowable 
symbols, namely 0 (zero), 1 (one), and 0 (don't care\ 
We consider each column as a Boolean function of n 
independent variables. If any column, F^ contains 
only zeros and ones, and if 2 n = m, then we will say 
that F. is completely specified . Otherwise, F, is 
incompletely specified . (We do not consider tne 
trivial case where F. is made up entirely of don't 
cares; i.e., is completely unspecified .) Our problem 
is so to choose the n independent variables that the 
p columns can be synthesized from these variables in 
a simple and cheap fashion under a set of fairly 
realistic assumptions. We also want to obtain a 
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Figure 1 


B 


9. Three columns, say F^, Fj, and F^ will have an 
intersection, F^* Fj* Ffc, if and only if 
**i* Fj, Fj_* Fjj, and Fj* F^ all exist. The 
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intersection will agree with all three col- 
umns wherever any of the three is specified. 
The extension to the intersection of any num- 
ber is obvious. 

10. Given two columns, F^ and Fj, F^ will be said 

to include Fj if and only if Fj agrees with 
F^ wherever the latter is specified. We 
write this relation as F^ 3 Fj. In Figure 1, 
we see that A 3 E, and B 3 F' . If F^ 3 Fj, 
we will call F^ the inclu ding column, and Fj 
the included column. It is obvious that F^ 
has at least as many 0 ’s as Fj. 

11. Below we list some of the more obvious and use- 

ful facts which follow from the above defini- 
tions: 


12 . 


13. 


a. If A3 B, and B D A, then A = B, 

b. If A 3 B, and B 3C, then A 3 C. 

c. If A3 B, B 3 C, and C 3 A, then A=B=C. 

d. If A 3 B, and B*C exists, then A»C exists. 

e. If A 3 B, then A ^ B' , unless A is com- 

pletely unspecified. 

f. If A*B exists, then A*B' does not exist, 

unless A or B or both are unspecified 
in every position. 

g. Obviously, if A3 B, then A*B = B. 

h. If A*B = B, then A 3 B. 

i. If A 3 B, then A' 3 B». 

j. If A»B exists, then so does A ,# B', and 

(A*B)' = A’*B». 

k. If A*B exists, let A*B = C; then A 3- C 

and B 3 C. 

For any column, F^, let 

IT q = number of zeros occurring in F^ 

N. , = number of ones occurring in F. 

1,1 V 

N. j = number of 0's occurring in F . 

i,0 i 

Any column, F^, will be called codable if and 
only if 


N. . < 2 
1,0 - 


,n-l 


and N. . < 
1,1 - 


,n-l 


and the seven columns represent seven control sig- 
nals which must be generated from the operation 
codes. Each operation must be represented by a 
three-bit code word. The manner in which the rows 
and columns are numbered is arbitrary. 

We should first examine the matrix row by row to 
insure that no two rows have an intersection. Should 
the above not hold, i.e., should there be two rows 
which have an intersection, then this would mean that 
the two operations corresponding to the two rows are 
not distinct, and could be replaced by one operation. 
In a normal situation, this would not be the ca 3 e.^ 

We note that in Figure 2, there are no two rows which 
have an intersection. 

We now focus our attention on the columns of our 
matrix. We first note that column 7 is a 1-column. 
Therefore we need not worry about encoding it, since 
we can obtain the corresponding control signal by 
using a constant ”1" signal in its place. We there- 
fore discard the 7 th column. 

At this point we examine the remaining six col- 
umns to determine whether there exist any inclusions 
between any two columns. We find that F 5 3 F]_. We 
note that wherever the control signal represented by 
F 5 is needed, we may with impunity replace it by the 
signal corresponding to F^, since F^ is the same as 
F^ wherever F 5 is specified. And since F^ has to be 
synthesized, we needn't synthesize F 5 , but merely use 
F^ in its place. Therefore we discard F^, and con- 
cern ourselves with the remaining five columns, 
namely F]_, F 2 , F 3 , F 4 , and Fg. We call this set of 
five columns Set A. 

We now examine these five columns to determine 
whether there exist intersections between any two of 
these columns. We find that F 2 * Fg exists. Let us 
define Fa = 5*2 * Fg. We observe that if instead of 
encoding F 2 and Fg, we encode F a , then we can replace 
F 2 and Fg by F a , since the latter agrees with both 
F 2 and Fg wherever either is specified, so that we 
will satisfy the original specification. Thus we 
now have only four columns to encode, namely F^, F 5 , 
F 4 , and F a . Figure 3 shows these four columns. 


Otherwise the column will be called not codable. 

14. We assume that constant signals representing 

" 0 ” and " 1 ” are available to us. 

15. Column F. will be said to cover column Fj if 

and only if either Fj 3 F i or Fj3 F^ 

III . Procedure 

The procedure that we will follow in order to 
obtain an encoding of the various operations is prob- 
ably best explained by first considering a concrete 
example. Figure 2 shows a 6 x 7 matrix: the six rows 



Figure 2 



Figure 3 


We should note parenthetically here that in our orig- 
inal matrix (Fig. 2) there also existed the intersec- 
tions F 2 * F$, F 2 • F* f F 4 • F 7 , and Fg • F 7 . All of 
these, however, involve Fc and F 7 which have been 
discarded, and therefore these four intersections are 
of no further interest to us. 

At this point we would like to identify each of 
the four columns in Figure 3 with one of the three 
independent variables or a complement of such a var- 

iThis remark may not apply in other applications 
to which our encoding procedure might be put; and it 
is conceivable that we might want to have two or more 
distinct operations even though one might suffice. 
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iable. Should this be possible, we would then have 
an ideal (or zero cost) encoding in the sense that 
each of the seven original signals could be obtained 
from the independent variables or a constant signal 
without any intervening logical network. If an ideal 
encoding is to be possible, then we must be able to 
identify each one of our gating functions with at 
least one of the independent variables (or its com- 
plement) . An ideal encoding very often is not pos- 
sible for one of two reasons. First of all, in order 
to be able to identify four columns with 3 independ- 
ent variables, we must identify at least two of the 
four columns with one of the independent variables; 
if this is to be possible, then there must exist at 
least one inclusion or intersection relation within 
our four columns. However, we know that this is not 
the case, and therefore at best only three of the 
four columns can be identified with the independent 
variables, and the remaining column will have to be 
synthesized with a logical network. Secondly, if a 
column is to be made into (i.e., identified with) an 
independent variable, then that column must be cod- 
able. This follows from the fact that if we write 
down the 2 n possible n-bit words, we note that each 
bit position contains exactly 2 n- ^ W 1 M 's, and 2 n ”l 
"0" 's. Since in our case n = 3, we know that no 
column which contains more than four ”1" 's or "0” 's 
can be used as an independent variable. We note in 
Figure 3 that column F^ is not codable. We conclude 
on the basis of either or both of the reasons stated 
above that an ideal solution is not possible in our 
example, and that at least one 2-diode gate will be 
needed to synthesize the signal represented by F4. 

We now identify the three remaining columns 
(namely 1, 3, and a — calling them Set B) with the 
three independent variables in any arbitrary fashion. 
All columns which are members of Set A but not of Set 
B constitute Set C ; in this case Set C contains col- 
umn 4. Figure 4 shows such an assignment , with 0 's 
replaced by blanks. 
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Figure 4 


It appears at first that we can choose any of the 
following code words for row 3: 010, Oil, 110, 111. 
However, 010 and Oil are used already by rows 4 and 
5, respectively; our choice is limited to 110 and 111. 
It can be seen that either of these two choices will 
give a valid encoding, in the sense that no code word 
appears more than once. Also, it turns out that 
either choice will give a simple encoding for F4. 
Arbitrarily, we chose 111 as the code word for row 3. 
Figure 5 shows that final encoding, together with the 
expressions for the control signals in terms of the 
independent variables. Of course, we can use the 
three independent variables in any order we wish, and 
we may complement any or all of them, so that there 
are many codes which axe symmetries of our code and 
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which give the same type of solution to this prob- 
lem.^ This fact is especially useful if we want to 
avoid the use of a specific code word (such as an 
all-zero code word) , because it is possible to elim- 
inate such a word by complementing one or more of 
the independent variables whenever m < 2 n . In our 
example, complementing of x^ will result in the 
elimination of the all-zero word. If this is done, 
then of course x^ and xj have to be substituted for 
one another in all the expressions where they occur. 

Ve must realize that when we ascertained that 
columns F^, F*, and F a were all codable, this was only 
a necessary, but not a sufficient condition for these 
columns to make a valid code. In Figure 1, columns 
C, D, and E are all codable, but they do not make a 
valid, code, since the code Oil occurs twice. (For a 
code to be valid , it is necessary and sufficient that 
no code word appear more than once.) 

We see that we have obtained here as cheap a 3-bit 
encoding of our original matrix as is possible, since 
we have in fact met our minimum estimate of cost. 

It should be apparent that this is not always the 
case, and that there may arise other problems when 
we attempt to encode very large matrices. Consider- 
ation of these points as well as a more rigorous and 
complete statement of our procedure will be the sub- 
ject of the next sections. 


IV. General Considerations 


In this section we will discuss some possible 
situations which have to be considered in order to 
make our procedure complete. 

1. Inclusions . Any time we find that two columns 
are in the following relationship F^ 3 Fj we may 
always replace F^ by Fj and discard F^. On the other 
hand, if the relationship is F^ D Fj then F^ may be 
discarded only if Fj or Fj • F^ (where F^ is any 
other column or intersection of columns) is used as 
an independent variable. Otherwise, we must synthe- 
size F^; this restriction would not hold if we 
allowed the use of inverters in addition to 2-stage 
diode logic. 


2 

Since there are nl ways of ordering n columns, 
and 2 ways in^which n columns may be complemented, 
there are ni*2 -1 codes which are symmetries of any 
n-bit code; some of these may be identical with the 
original code. 
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2. Intersections . Only those intersections 
which are codable are of interest to us when we are 
choosing the columns to be used as independent vari- 
ables. Non-codable intersections are only useful in 
the sense that they can be used sometimes to get a 
cheaper network when the columns which make up the 
intersections are to be expressed as f vine t ions of 
the independent variables. For instance, if we have 
two non-codable columns which have an intersection, 
it may sometimes prove cheaper to synthesize the 
intersection as one function rather than the two 
columns as two separate functions. For this reason, 
non-codable intersections are of interest only if 
they contain no primed columns. (Again, this last 
statement would not be true if we allowed the vise of 
inverters.) 

It sometimes happens that we find that the follow- 
ing (codable) intersections exist: F^ • F j and 
Fi • Fjj- (but Fj • Fij. * 0). In that case, we would 
use Fj and F-j_ • Fjj- tor F^ • Fj and Fjj.) rather than 
Fi • F j and Fi • Fjj., since in any case we cover all 
3 columns, but Fj has at least one more 0 than 
Fi * Fj, and it is in general desirable to have many 
0's in the columns used as the independent variables, 
because this gives us more freedom in “tailoring" 
the independent variables. 

It may occur that Fi • Fj = Fi • Fj • F^; this 
means that Fi • Fj C F^.. in such an instance we 
would usually consider the intersections as includ- 
ing Fjj;, namely as Fi • Fj • F^ rather than as just 
Fi • Fj. 

3. Validity . We have given above the necessary 
and sufficient condition for a code to be valid. It 
is quite simple to ascertain whether n columns make 
a valid code if there are no 0' s in these columns. 

If very many 0's do appear, however, it becomes more 
tedious to determine whether a code can be made 
valid. However, it is never necessary to resort to 
enumeration, since the problem of determining whether 
a code can be made valid is equivalent to the "mar- 
riage problem" and can be solved by the algorithms 
given in references 6, 7, and 8. 

4. Cost . Once a valid code covering as many of 
the columns of our original matrix as possible is 
found, it is possible to determine (from the number 
and type of columns which are not covered by this 
code) the minimum cost of realizing our circuit (in 
terns of either diodes or diode gates) . A more 
detailed discussion of this particular procedure is 
given in the Appendix. 

V. Summary of Procedure 

In this section we give a list of the steps which 
we execute in encoding a matrix. 

1. Discard all 1-columns and O-columns. 

2. Find all inclusion relations of the Fil> Fj 
type which exist among the columns remaining 
after Step 1. Discard all the inclu ding 
columns found in this manner. Call A the set 
of columns remaining now. 

3. Make a f list of all the inclusions of the type 
Ff D F j which exist in A and for which F j is 
codable. 

4. If it is desired to eliminate redundant rows 
from our original matrix, then examine the 


rows for such redundancies and eliminate 
them. The procedure here consists essen- 
tially of finding all inclusions among rows 
(of the type Ci o cj) , discarding all in- 
cluding rows, and then of finding all the 
intersections (which contain no primed rows) 
among the remaining rows and of combining the 
rows so as to get as small a number of row3 
as possible. When this is done, we may find 
that we need fewer bits than n to encode our 
matrix, since by eliminating rows we have 
decreased the value of m. If, however, the 
smaller number of bits is used in our code, 
we will often find that the cost of the final 
circuit will be higher than if we used the 
original values of m and n. 

5. In Set A, find all pairwise intersections 
which either contain no primed columns, or 
are codable (or both). 

6. Using the results of Step 5, form all the 
maximal intersections from Set A which again 
are either codable or contain no primes. 

7. Using the results of Step 6, form two sets: 

Set B will contain all the maximal codable 
intersections, as well as all the codable 
columns which are members of Set A but are 
not covered by any intersections included in 
B. 

Set C will contain all the maximal non- 
codable intersections found in Step 6 as well 
as all those non-codable columns in A which 
are not covered by intersections included in 
Set C. These two sets cover all the columns 
in A. 

8. From Set B, choose a set of n members. Call 
this set D.2 D is so chosen that all the 
columns of A which do not appear in D can be 
covered with a minimum number of members of 
Sets B and C. Set D is then modified using 
the results of the second paragraph of Section 
17-2 above, so that no column of A is coverid 
by more than one member of D. Should there be 
more than one possible choice of members of D, 
then we choose the members so as to get as 
many 0’s in D as possible. 

9. Check Set D for validity. If it is not valid, 
repeat Step 8 and choose the next best set for 
D. Repeat Steps 8 and 9 until a valid set is 
found. In executing Steps 8 and 9 we may be 
able to use some of the relations obtained in 
Step 3 above to eliminate or simplify some 
members of Sets B and D, as indicated in Sec- 
tion IV-1 above. 

10. At this point we can compute a minimum bound 
on idle circuit cost (see the Appendix). 

11. Once a valid set is found, we may have to re- 
place some of the 0's by l's or 0's to assure 
the validity of the set. The remaining 0's 
are replaced by l's or 0's so as to make easy 

5lf B has n members, then Sets B and D are 
identical. If B has less than n members, we add ar- 
bitrary columns to B. This is illustrated in Section 
VI below. If B has less than n members, and Set C is 
empty, this implies that there are redundant rows in 
our matrix. 
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the encoding of the columns which are not 
identified with the independent variables 
(but of course still preserving the validity 
of the code) . 

12. If the number of possible valid D sets is 
small, or if this procedure is carried out on 
a computer, we may want to try several of 
these sets in order to obtain a better solu- 
tion. The amount of enumeration here is 
usually quite small and fairly manageable, 
especially on a computer. 

13. Once a code is obtained, we encode all the 
members of Set C as well as those members of 
Set B not included in our final D set. There 
may be here more than one possible set of 
columns we may choose for encoding, as indi- 
cated in the first paragraph of Section IV-2. 
The encoding proper is done by standard min- 
imization methods for Boolean functions, and 
is well described in literature. (See, for 
instance, references 1 and 2.) 

VI . Illustrative Example 

The problem which we are treating here was first 
proposed by E. Hirschhom (Ref. 3). The procedure 
proposed by Hirschhom is somewhat intuitive and 
depends strongly on the designer's ingenuity. 
Hirschhom gives an example in that paper, and we 
will treat the same example in order to afford us 
some amount of comparison between the two methods. 

The matrix is shown in Figure 6. The value of n 
is set at the minimum, namely 5. Hirschhom finds 
that if a straightforward code is used (i.e., 00001 
is assigned to row 1, 00010 to row 2, 00011 to row 3, 
etc.), then 45 diodes and 16 gates, or 39 letter 
symbols (or contacts) are required to synthesize the 
ten control signals. Hirschhom 's procedure reduces 
these numbers to 24 diodes and 9 gates, or 25 letter 
symbols. Both we and Hirschhom design the circuit 
for each control signal separately, rather than de- 
signing one multiple output network for all the con- 
trol signals. 
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We will now treat this example by our method. 

1. Ft- is a 0-column; it is discarded. 

2. There are no inclusions. 

3. There are no inclusions. 

4. Hirschhom did not attempt to eliminate any 
rows. To afford a better comparison, we will 
do likewise. However, we should point out 
that the number of rows can be reduced down to 
seven, by making use of the following inter- 
sections which exist in the matrix: 

VVVV C 7‘VV C ll‘ C 12* C 13 ,C l6 

C 7* C 10* C 13* C 14* C 15 

c 17‘°18 

These are not the only intersections which occur in 
the matrix, but they are sufficient to show that the 
following transformation could be made: 


c 

a 



replaces c 


ii 


c 


1 

2 


c 

c 


c 

e 


C 3* C 

c 7 c 

°17* 


'19 

3 20 


4’V WV C 11* C 12’ C 16 
10 c 13 C 14 °15 
C 18 


Therefore n could be made as small as 3. 

5. Set A consists of all the columns except F_. 
The following pairwise intersections exist 5 
(we omit the letter F, and use the subscripts 
only): 


1 • 9' 


3 * 8* 

4 • 9' 

6 • 8, 6 • 9, 6 * 10* 

7 • 8, 7 • 9‘ , 7 * 10 

8 * 9, 8 * 9', 8 * 10 

9 * 10, 9 * 10* 


Columns 1 and 2 are not codable, since 2 
n « 5, and N, , *= 18 and N 


n-1 


16 for 

Np *s 18; therefore we 
discard the ifttersection 1 * U 9' , since it is not 
codable and contains a primed variable. 


6. The following intersections are maximal and 
codable (by the rules of section 11-11 above): 


3 • 8* 


4 * 9* 


6*8*9 

6 * 9 * 10* 

7 • 8 • 9 ' • 10 


7. Set B is identical to the list of intersec- 
tions immediately above. Set C contains columns 1 
and 2. 

8. We note that we can cover all of B with four 
columns. In fact, we can choose any of the following 


Figure 6 
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sets: 


a. 

3 • 8‘, 4 • 9', 6 * 10’, 7 

31 

b. 

3 * 8', 4 * 9', 6, 7 * 10 

31 

c. 

3 * 8', 4, 6 • 9 * 10', 7 

31 

d. 

3 * 8', 4, 6 • 9, 7 * 10 

31 

e. 

3 • 8', 4, 6 * 10', 7 * 9’ 

31 

f. 

3 • 8', 4, 6, 7 • 9' -10 

31 

e. 

3, 4 • 9*, 6 * 8, 7 • 10 

29 

h. 

3, 4 • 9', 6 • 10’, 7 * 8 

29 

i. 

3, 4 • 9’, 6, 7 • 8 • 10 

30 

j. 

3, 4, 6 • 8 • 9, 7 * 10 

29 

k. 

3, 4, 6 • 9 * 10\ 7 • 8 

29 

1. 

3, 4, 6 • 8, 7 • 9' • 10 

29 

m. 

3, 4, 6 * 9, 7 * 8 • 10 

30 

n. 

3, 4, 6 * 10’, 7 • 8 • 9' 

29 

0. 

3, 4, 6, 7 • 8 • 9' * 10 

30 


Next to each row, we have listed the sum of all 
the 0’s in the four columns. Thus we should 
first choose D from among the first six possi- 
bilities. At this point we can further narrow 
down our choice by considering the columns not 
covered by D. By examining column 1, we note 
that if the code words for rows 19 and 20 were 
to be adjacent (i.e. , if they differed in only 
one bit position) , then we could synthesize F^ 
with only one gate. Similarly column 2 shows us 
that we want row 1 and row 2 code words to be 
adjacent. As a result, we would like to avoid 
the following intersections in our code: 

4*9, 7*9', 7*9* * 10, 7*8.9' , and 7*8*9'*10 

because the columns resulting from these inter- 
sections differ in positions 1 and 2jor 19 and 
20. (This is also true of any intersection 
which covers column 3, but since we want to in- 
clude this column in our code, we cannot do any- 
thing about it.). Thus our first choice narrows 
down to two sets: 

* 3*8’, 4, 6»9*10' , 7 

and 3.8', 4, 6*9, 7*10 

9. Both of the above sets are valid, so we'll try 
the first one. Figure 7 shows these four col- 
umns and the fifth column (15) needed to make 
a 20-word code. Blanks are used instead of 
0's, and the underlined entries have been made 
in accordance with the considerations given 
above to facilitate the encoding of columns 1 
and 2. We must make sure that such replace- 
ments of 0's with 0's and l's do not make our 
code invalid. In our case the code is still 
valid. 

10. Since we have found a code which takes care of 
all but the first two columns, we conclude that 
the minimum cost for our final circuit will be 
at least two 2-diode gates, or 4 diodes and 2 
gates, or 11 letter symbols. (See Appendix for 
details of this particular step.) 

11. We now fill out the remainder of our code in 
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x 1= = 3.8* 

x 2 = 4 


6*9«10' 


v 7 


any arbitrary fashion, with the only restriction 
being that the code has to be kept valid, i.e., 
no two rows may be assigned the same code word. 

12. We may, if we wish, try one or more of the re- 
maining fourteen possible D sets, to determine 
whether a better code is possible. We will not do 
this, since, as we will see in the next step, our 
solution is quite good. 

13. Since we have used 20 of the 32 possible code 
words, the remaining 12 words become in effect 
"don't cares" and may be used to simplify the 
expressions for F^ and F2. Figure 8 shows the 
code filled out as per Step 11 above, and the re- 
sulting expressions for the 10 gate signals. We 
see that our solution requires 7 diodes and 2 
gates, or 14 letter-symbols. This is quite close 
to the minimum cost bound as determined above, and 
considerably better than the solution proposed by 
Hirschhom, which requires 24 diodes and 9 gates, 
or 25 letter symbols. 

We should note that while performing Step 8, we 
could have also observed that since with six columns 
we could have covered all tiie members of Set A, and 
that if we were to use six bits, all the columns of 
Set A would be codable, so that it is conceivable 
that with n = 6 we could have obtained an ideal solu- 
tion, provided we could have made a valid code. In 
our case this is not possible for n * 6, because if 
we use columns 1, 2, and 3 (or 3*8') as three of the 
six code columns (and we must do this to obtain an 
ideal solution), then we note that rows 3 through 14 
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would be Identical in these 3 columns , so that there 
would only be eight possible code words that could 
be inserted into these twelve positions. If we were 
willing, however, to use n = 7, then an ideal solu- 
tion would be possible. 

This type of consideration is quite valuable in 
determining, for a particular problem, what the 
"exchange ratio" is of cost versus code bits (in 
excess of the minimum number of code bits). 


tion might in fact greatly enhance the usefulness of 
our method by allowing us to try several of the pos- 
sible codes as determined by our procedure (see Sec. 
IV, Step 12) . 

Finally, we should point out that this problem 
has also been treated from a somewhat different point 
of view by E. J. Schubert (Ref. 4) . 

VIII . Conclusion 




x 2 + *3 + x 4 + *5 
x 2 ' x 4 * x 5 
X 1 

X 2 

0 


*3 

x 4 

x i 

*3 

i' 


Figure 8 


VII. Discussion 

In order to make this presentation more complete, 
we will mention here a few considerations pertinent 
to the use of our encoding method. 

First of all, we should realize that while we dis- 
cussed the procedure in terms of design of central 
control of a digital computer, this is by no means 
the only application to which our method can be put. 
Another possible use it has is in the synthesis of 
the output networks of a sequential switching cir- 
cuit. In fact, we can use this procedure whenever 
we need to simultaneously encode any number of 
Boolean functions with some number of variables. 

As is true for most procedures of this type, our 
method will be better suited to some problems than 
to others. In particular, it lends itself better to 
encoding matrices which contain many "don't cares" 
than to completely specified ones. Since the method 
does not try all the possible assignments (there are 
1.4 x 1023 possible distinct assignments for a 20- 
word code which uses 5 bits — see ref. 5) , we are 
not guaranteed that our solution will be as good and 
cheap as possible, except in the cases where we meet 
our lower bound on the cost; in most cases, however, 
we do come quite close to this bound, and thus get 
quite a good solution. It is also true that this 
bound cannot always be met, but it does give us an 
idea of how good our solution is. 

The main advantage of our procedure is that it is 
quite systematic, and lends itself well to being 
programmed on a digital computer. Such a mechaniza- 


A method has been presented for simultaneously 
encoding an arbitrary number of Boolean functions in 
an economical manner. Included is a means for esti- 
mating a lower bound on the circuit cost, so that it 
is possible to obtain some indication of how good a 
particular encoding is. The entire procedure is 
quite systematic, and can be mechanized on a digital 
computer. The application of this procedure to the 
synthesis of gating signals in the central control of 
a digital computer is given, and other applications 
are indicated. The advantages and limitations of the 
method are discussed. 


Appendix 

We have defined above the cost of a circuit as 
being the number of diodes needed to construct that 
circuit in a 2-stage form. We will give here a 
method for estimating the lower bound on the cost of 
a circuit in terms of both the diodes and the gates 
needed to realize the circuit in 2-stage form. The 
reason for including the cost in terms of gates is 
that in some realizations (for instance, when resis- 
tor-transistor logic is used) the number of gates or 
packages is of more interest than the number of 
diodes. 

Let us define P^ and Pg as the minimum costs in 
terms of diodes and gates, respectively, of our cir- 
cuit. We determine these values in the following 
fashion: 


1 . 


2 . 


Consider Sets B and B, as defined in the text 
above . Let b be the minimum number of columns 
needed to cover all those columns in B which 
are not covered by the largest valid D set . 
thenTT is obvious that we will need at least 
b gates and 2b diodes to realize the signals 
corresponding to all the columns in Set B. 
Consider a column, F^ , which is not codable. 

We can easily count the Ni 0 and % ^ for this 
column, and 


N, v = 2 - N. - N _ 

i,P i,o M 


Let N. stand for either N. or N. . , 
the folJowing relation holds? 0 ' 


< 2 s < N 

,x ~ “ i»x 


+ N 


i,0 


If 

( 1 ) 


where s is an integer, 0 < s< n, then this 
means that it might be possible to make N. = 

2 by replacing the proper number of 0's 1X 
by l's (or O's). If this is possible and is 
in fact done, then F^ can be realized with a 
single gate and n-s diodes. Thus we want s as 
large as possible. As an example, consider F^ 
in our illustrative example: 
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N. - 2 N, .= 18 N. 12 n = 5 
i,o i,l i ,|T 

2 < 2 s < H 


Thus s can be 1 , 2, or 3. If we choose so3, 
then conceivably we could synthesize F]_ with 
one gate and n - s = 2 diodes. If relation 
(l) above does not hold, but the following 
relation holds, 


N. < 2 s 
i,x- 


+ 2 < N. 

— l.X 


+ N 




( 2 ) 


where t is an integer, 0 < t < s, then the 
number of l's (or O's) in F* can be made 
equal to 2 s + 2 , by substituting the proper 
number of l's and O's for the ff's in F^. If 
this is done, then the following possibilities 
exist: 

a) If 3 = n-1, then we can synthesize F^ 
with 2 gates and n - t + 1 diodes. As an 
example, consider a case where Ny_ = 17, n=5. 
We see that s = 4 (= n - l) , and t = 0. The 
function f = xj + X£ xj *4 x^ is seen to con- 
tain 17 points, and can be realized with 2 
gates and n-t+l= 5 - 0 +l =6 diodes. 

b) If s < n - 1 , and t = s - 1, then we 
can realize Fi with 2 gates and n - s + 2 
diodes. For instance, if % 1 = 12, n = 5, 
then s = 3 , t = s - 1 = 2 , and the function 
f = x^X 2 + z i ^3 = xy (xp+ X 3 ) can be realized 
with 2 gates and n-s+2=5-3+2=4 
diodes. 

c) If s< n - 1, t < s - 1, 3 gates are 
needed and 2n - s - t + 1 diodes. Let, for 
an example, Njji = 10 , n = 5 ; then s = 3 , 

t = 1. The function f = X]X 2 + X 1 X 3 X 4 con “ 
tains 10 points. It can be realized with 3 
gates and 

2n-s-t + l = 10-3-l + l = 7 diodes . 5 


The formulas given above are fairly easy to 
derive. However, it becomes quite difficult 
to generalize this procedure to the cases 
where Fi doesn't satisfy either relation (l) 
or relation ( 2 ) above. At present, it appears 
best to set the lower bound in such cases to 
3 gates and 6 diodes (since each gate has at 
least 2 inputs) . 

3. If we now consider Set C as defined in the 
text, we can obtain the minimum bound on the 
cost of covering all the columns in C by con- 
sidering all the possible combinations of 
columns and/or intersections which cover all 
the columns in C, getting the bound on each 
set, and choosing the lowest of these bounds. 
Let this lowest bound be B 4 and R„, for diodes 
and gates, respectively. For instance, if a 
Set C consisted of F^ • F 2 , F 2 • F 3 , and F 4 , 
then we would have to compute the following 
sums, where Q(i) stands for the cost of F^ as 
determined by Step 2 above: 

a. Q(1.2) + Q(3) + Q(4) 

b. Q(l) + Q(2*3) + Q(4) 

c. Q(l) + Q( 2 ) + Q(3) + Q(4) 

This is done both for the diodes and gates. 

We then pick the lowest of these sums as 
and Rg respectively. 



We must note here that the minimum cost bounds 
have been computed on the assumption that the 
networks which realize the various signals res- 
resented by the columns in B and C sets are syn- 
thesized separately for each such column. This 
is often the case in many applications, since it 
makes the design and trouble-shooting easier (a 
fault in any part of such a network affects only 
one gating signal). It is, however, possible 
that in certain cases a saving could be obtained 
by designing only one multiple output network for 
all the gating signals. It must of course be 
realized that the minimum cost bound as computed 
by the method given above is not necessarily 
realizable so that in some cases there may exist 
no solution which meets that bound. However, 
this bound does give a fair indication of how 
good any given solution is, and in those cases 
where it is met, it guarantees that we have in- 
deed an optimum solution. 


Part of the research presented in this paper was 
done while both authors were employed by the Bell 
Telephone Laboratories in Whippany, N. J,, and the 
remainder of the research was supported by the 
Higgins Funds at the Department of Electrical Engi- 
neering of Princeton University. 
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Summary 

A novel Approach to arithmetic operations 
in digital computers is described which 
requires fewer stages of logic and fewer com- 
ponents than conventional adder circuitry. 

This is accomplished by using the decimal 
digits in an arithmetic field to complete 
memory addresses which in turn are used to 
reference a set of answers prestored in mag- 
netic core storage. Multiple non-contiguous 
memory addresses stored in the machine 
hardware then reference the various digit 
positions of the sum, difference, or partial 
product at the proper time in the arithmetic 
cycle to develop the result. The accuracy of 
all results is easily verified by simply per- 
forming an automatic validity check on the 
data paths within the computer system. 

The machine operator need not concern 
himself with the "hardware" method used 
within the computer, since the arithmetic 
commands commonly found in digital com- 
puters(Add, Subtract, etc.) are available in the 
instruction set for the system shown; table 
lookup circuitry does the work. 

Introduction 

Arithmetic operations (addition, subtrac- 
tion, and multiplication) in the computer 
system described in this paper are performed 
without the use of conventional adder cir- 
cuitry as found in most stored program com- 
puters. This is done by making reference at 
the proper time in the arithmetic cycle to a 
set of answers prestored in an assigned table 
area in magnetic core storage. Using this 
method, arithmetic results are obtained with 
considerably fewer components and fewer 
stages of logic than are found in conventional 
adder circuitry. Furthermore, a completely 


checked result is obtained, since a sum, dif- 
ference, or product is created solely by 
routing the proper data onto the data paths of 
the machine; here it can undergo a parity 
check, character by character. 

Before a detailed description of arithmetic 
operations can be given, it will be necessary 
to review a few design features of this system 
which make table lookup arithmetic feasible. 

A partial data flow schematic is shown in 
Figure 1. (For reasons of clarity, only 
required components are included; input/output, 
timing, and control are implied. ) 

The main storage medium is a three- 
dimensional magnetic core array, 12 planes 
deep and containing 20, 000 decimal digits. It 
is addressed serially by digit from a single 
transistorized 5-digit Memory Address Regis- 
ter (MAR). The array is driven from two 
10x10 matrix switches which select one 
"column" of 1 2 cores to be read from memory 
to the transistorized Memory Buffer Register 
(MBR). Since digits in this system consist 
of 6 bits (4 data bits, 1 parity check bit, and 
1 flag bit), the state of these 12 cores read 
from memory by one read comipand is placed 
in MBR as two digits. (If the digit at memory 
location XXXXN, where N is even, is addressed 
.for read-out, the digit at memory location 
XXXXN + 1 will be read out as well; if the 
digit at memory location XXXXN + 1 is 
addressed for read-out, the digit at memory 
location XXXXN will be read out as well. ) 

This two-sequential-digit output from memory 
using only one address is of great importence 
in multiplication. Its application will be seen 
later. When the two digits have beer, 
received by the MBR, further selection of the 
digit is made, based on the units position of 
the address in MAR, and it is ascertained 
which of these digits is to be sent to the one- 
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digit transistorized Memory Data Register 
(MDR). The digit may then be sent from the 
MDR to other machine locations. 

Three -hundred- twenty memory locations 
are assigned as a ’’table area": a 20-digit 
area from memory locations 00080 through 

00099 to receive the product in multiplication; 
a 100-digit area from memory locations 00300 
through 00399 for storage of an addition table 
for use by the computer during the execution 
of all arithmetic instructions (Figure 3); and 

a 200-digit area from memory locations 

00100 through 00299 for storage of a multipli- 
cation table, for use by the computer during 
the execution of the multiply instruction 
(Figure 7). All of these "table area" locations 
are in addressable memory. 

Table lookup arithmetic requires that mul- 
tiple noncontiguous memory addresses be avail- 
able during the operation so that factors can be 
obtained and answers can be written back into 
memory at the various digit positions of the par- 
tial product (or sum or difference) at the proper 
times. This is accomplished through the use of 
a two-dimensional core array called Memory 
Address Register Storage (MARS, Figure 1) 
which supplies the Memory Address Register 
(MAR) with the proper address required in each 
machine cycle. The eight 5-digit registers in 
MARS which are available for MAR to use in 
addressing memory are Instruction Address 
Registers 1 and 2 (IR-1 and IR-2), Operand 
Address Registers 1, 2, and 3(OR-l, OR-2, 
and OR-3), and Product Address Registers 1, 2, 
and 3 (PR-1, PR-2, and PR-3). Their use will 
be discussed later. In any one machine cycle, 
MAR is controlled to accept an address from 
one of the MARS registers. During the machine 
cycle, the 5-digit address in MAR is returned 
to a MARS register or registers by way of the 
Increment Switch. In passing through the incre- 
ment switch the address from MAR may be 
incremented by 1 or 2, or decremented by 1, or 
may bypass the switch and remain unchanged. 

The MARS register to which the MAR address 
is returned may be the MARS register from 
which the address originally came, or a dif- 
ferent MARS register, or the original and a dif- 
ferent MARS register. The control for these 
operations is determined by the operation to be 
performed. 

The remaining hardware required for des- 
cribing table lookup arithmetic consists of a one- 
digit Multiplier Register, a two-digit Digit 
Register, two translators called True/Comple- 
ment Switches 1 and 2, two Plus 1 Switches, and 
a Doubler. The registers serve as temporary 


storage devices for digits; the true /complement 
switches provide either the true numeric value 
of a digit supplied to them or its nines comple- 
ment; the plus 1 switches increment by one the 
output of true /complement switch 1 and/or the 
tens digit of the doubler output; and the doubler 
does just what its name indicates, i. e. , it 
supplies a two-digit output twice the value of its 
one-digit input. Table lookup arithmetic does 
not require the use of counters, shift registers, 
word register s , accumulator registers, or 
adders. 

Along with the above "hardware" descrip- 
tion, it should be noted that the instruction set 
of this computer system consists of two- 
address, 12-digit instructions, so that the 
locations of both fields involved in an arithme- 
tic operation are specified in one instruction. 
During instruction interpretation time (I-time), 
these two 5-digit addresses are loaded into 
OR-2 and OR-1 for use during instruction 
execution time (E-time). All arithmetic opera- 
tions are performed serially, one digit at a 
time, on data words (fields) which are variable 
in length. Each field is defined by placing a 
flag bit over the high order digit of the field. 

The units digit of a field may also have a flag 
bit, but this is used to denote that the field is 
negative; if the flag on the units digit is missing, 
the field is considered positive. ( Note : Since 
flags have this dual function in arithmetic 
fields, the minimum length of a field is two 
digits. No restriction exists on the maximum 
length, except what is practical for the stored 
program being executed. ) 

Addition 

Let us now follow the logic of the addition 
of two positive fields as it would occur in a 
typical Add instruction. The general approach 
taken by logical circuitry within the machine 
is the use of corresponding digits of the addend 
and augend to form a unique address in core 
storage at which is stored the sum of the two 
digits selected. This sum then replaces the 
augend digit in memory, and the procedure is 
repeated using the next higher order digits of 
the two fields. 

Assume that I-time for the instruction has 
been completed so that the 5-digit memory 
addresses of the addend and augend have been 
stored in MARS registers OR-1 and OR-2, 
respectively. The address of the augend stored 
in OR-2 has also been stored in OR-3, should 
recomplement be required. In Figure 1, it can 
be seen that in the first machine cycle the 
address contained in OR-1 is read out of MARS 
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to MAR. Using this MAR address, the digit in 
the units position of the addend is read from 
memory to MBR and thence to MDR. From 
here it is placed on the data bus and stored 
temporarily in the digit register (units). 

Simultaneously, the address in MAR is passed 
through the increment switch, decremented by 
1, and replaced in OR-1 as 09174 (i. e. , the 
address of the tens digit of the addend). 

At the beginning of the second machine 
cycle the address contained in OR- 2 is read 
out to MAR. The digit at this address (the 
units position of the augend) is then read from 
memory to tne MBR and then the MDR. The 
address from MAR bypasses the increment 
switch and is replaced unchanged in OR- 2. 

After memory has been read in this cycle 
(shown in Figure 2), MAR is reset and the 
augend digit from MDR passes through true/ 
complement (T/C) switch 2. The true output 
of T/C switch 2 is stored in the tens position 
of MAR. Simultaneously, the addend digit 
previously stored in the digit register (units) 
is passed through T/C switch 1 and the true out- 
put, bypassing the Plus 1 Switch, is placed in 
the units position of MAR. The remaining 
positions of MAR are filled with fixed digits; 
zeros are forced into the ten-thousands and 
thousands positions, and a "3” is forced into 
the hundreds position. 

This address is then used to read a digit 
from the addition table stored in memory 
(solid coordinates in Figure 3) to the MBR and 
then the MDR during the third machine cycle 
(Figure 4). This is the units digit of the sum. 

The fourth machine cycle finds OR- 2 again 
read out of MARS to MAR. The augend digit 
at the corresponding memory location is 
replaced by the units digit of the sum which was 
stored in MDR during the previous machine 
cycle. The address from MAR simultaneously 
passes through the increment switch, is decre- 
mented by 1, and is replaced in OR- 2 as 14623 
(i. e. , the address of the tens digit of the 
augend). One add cycle has now been completed 
and one digit of the sum developed. 

The procedure is now repeated in the next 
four machine cycles, developing the sum of the 
tens digit of the addend and augend (the addresses 
of these digits are now contained by OR-1 and 
OR- 2, respectively). A carry occurs using 
this pair of digits which is detected and handled 
in the following manner: After the proper 
digits of the addend and augend are routed to 
MAR, the manufactured address is used to 
read the second digit of the sum (flag bit 


included) from the addition table in memory 
(dashed coordinates in Figure 3) to MDR. The 
flag bit stored with this digit denotes a carry 
resulting from the addition of the two digits. 
(Detection of this flag on read-out sets a logi- 
cal path to provide for the inclusion of the 
carry when the hundreds digits are added. ) 

The sum digit from MDR (without the flag) is 
stored back in memory per OR- 2, replacing 
the tens digit of the augend. At the end of the 
second add cycle, the MARS addresses are: 


MAR 14623 

MARS OR-1 09173 

MARS OR- 2 14622 

MARS OR-3 14624 


The last digit of the sum is developed in 
the same manner (Figure 5), except that the flag 
detection from the previous add cycle causes 
the output of the digit register (units) to pass 
through T/C switch 1 and the plus 1 switch to 
compensate for the previous carry. The 
address in MAR used to obtain the sum digit 
is, therefore, increased by one over that 
which would normally be used. The aum digit 
is then placed back in memory, replacing the 
last digit of the augend. The fact that both 
fields contain a flag bit in the digit position 
used in this add cycle terminates the add 
operation, and the computer starts the I-time 
of the next instruction in the sequence. 

Addition of two fields requires 4A machine 
cycles where A equals the number of digits in 
the augend. 

Addition of two fields with unlike signs and 
subtraction are carried out by complement 
addition, but the procedure is only slightly 
changed from that previously described. If, 
for example, the augend field had been nega- 
tive in developing the previous sum, the out- 
put of T/C switch 2 would have been the nines 
complement of its input. The table lookup is 
then performed as before, using the true value 
of the addend and the complemented value of 
the augend for the table lookup address. 

( Note: To get the tens complement of the field, 
a carry in is assumed on the first add cycle 
which is implemented by having the plus 1 
switch increment the output of T/C switch 1 
by one before the digit gets to MAR, ) 

Recomplement is performed whenever the 
signs of the augend and addend are initially 
different and the sign of the sum is negative. 
After the sum is completed and it is 
determined that recomplement is required, 
the reset address of the units digit of the sum 
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is transferred from OR-3 to OR- 2 and another 
pass is made through the sum, complementing 
each digit. This, too, is performed using the 
table lookup procedure. Each digit of the sum 
passes through T/C switch 2 whose output, in 
this case, is the complement of the sum digit 
input. This output digit is routed to the tens 
position of MAR and a zero is forced in the 
units position; the table lookup performed 
using this address causes memory to yield the 
complement of the original sum digit to MDR. 

It is then placed back in memory in the proper 
sum digit position. The procedure is repeated 
for each of the sum digits until the recomple- 
mented field is completely developed. Recom- 
plement requires four machine cycles per 
digit of the sum. 

Multiplication 

Multiplication utilizes six of the eight 
MARS registers to store addresses and makes 
reference to a multiplication table as well as 
the addition table. The product is built up 
serially in much the same manner as a third- 
grader performs his multiplication. Logical 
circuitry within the machine sequences the use 
of single digits of the multiplier and multipli- 
cand to form unique core storage addresses, 
at which are stored the partial products. 

After a partial product is added (through table 
lookup) to the product area, the logic deter- 
mines the next pair of multiplier and multi- 
plicand digits to be used. The process con- 
tinues building up the product in the product 
area until each digit of the multiplier has been 
used with each digit of the multiplicand and the 
product has been developed. 

During I-time in the Multiply instruction, 
OR-1 is loaded with the address of the multi- 
plier, and OR- 2 and OR-3 are loaded with the 
address of the multiplicand. The product area 
in memory (locations 00080 through 00099) is 
cleared to zeros, and address 00099, the units 
digit of the product, is placed in PR-1. 

During the first E-time cycle , the units 
digit of the multiplier is read from memory 
per OR-1 to the multiplier register, and OR-1 
is reloaded with the address of the tens digit 
of the multiplier; it remains inactive until each 
digit of the multiplicand has been used. 

On the second machine cycle, the units 
digit of the multiplicand is read from memory 
per OR-3 to MDR and thence through T/C 
switch 2 to the tens position of MAR. OR-3 is 
simultaneously decremented by one. Figure 


6 shows the status of MARS at this point. 

During this same cycle, the digit from the mul- 
tiplier register is doubled and sent to the 
hundreds and units positions of MAR; the 
hundreds position is increased by one before 
reaching MAR by passing through the plus 1 
switch. 

The manufactured address in MAR is then 
used (third cycle) to read from memory to 
MBR-Even and MBR-Odd the two-digit product 
of 6 and 9 (solid coordinates in Figure 7). 

J^Note: The advantage of having two sequen- 
tial digits read from memory for each 
address used is obvious here. It should 
also be noted that this product is stored in 
memory with the digits transposed. Since 
the multiplier digit is always doubled, the 
address in MARwill always be even, per- 
mitting the addressed digit to pass to 
MBR-Even, to MDR, and thence to the 
units position of the digit register. The 
digit from MBR-Odd is transmitted on a 
separate data bus to the digit register 
(tens), where the product then appears in 
proper sequenceTJ 

PR-1 is read out to MAR and the units 
digit of the product area (initially zero) is 
transferred to MDR during the fourth cycle. 

In parallel the MAR address is placed unchanged 
in PR- 2, and also into PR-1 decremented by 
one (Figure 8). (PR-1 will not be used again 

until all digits of the multiplicand have been 
used. It functions then as a reset address for 
the proper positioning in the product area. 

Its function will be demonstrated later. ) The 
digit in MDR and the product units digit from 
the digit register (units) are then used to 
obtain a sum address in the usual way. 

On the fifth cycle the manufactured 
address in MAR is used to read the sum of 0 
and 4 from the addition table in memory to 
MDR, as described earlier. 

On the sixth cycle PR- 2 is read out to 
MAR, then decremented by one and placed 
back into PR- 2 and PR-3 simultaneously. In 
parallel, the new partial product low order 
digit in MDR is read back into memory per the 
MAR address. 

PR- 3 is read to MAR and placed back in 
PR- 3 unchanged on the seventh cycle, so that 
the tens digit of the product area can be read 
from memory to MDR and routed to MAR Tens 
to help create a sum address as before (Figure 
9). The product tens digit from the digit 
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register (tens) completes the variable portion 
of the address. 

Reading from memory using this address 
in the eighth cycle obtains the sum of 0 and 5 
from the addition table to MDR. 

The last cycle (ninth) in this sequence 
causes the digit in MDR to be placed back in 
memory using the address from PR-3. The 
two-digit product of the units digits of the 
multiplier and multiplicand is now located in 
the proper position in the product area. 

^ Note : The interaction of PR-2 and PR-3 is 
not obvious from this example. Each time a 
two-digit product of a multiplier digit and 
multiplicand digit is created, it is necessary 
to add these two digits serially to the partial 
product previously developed. PR-2 is used 
to add the units digit to the partial product, 
and it then steps to the address of the tens 
digit. Any carries which may occur from 
the addition of the units digits can be propa- 
gated by increasing the address used in the 
table lookup of the addition of the tens digits. 
Carries resulting from the addition of the 
tens digits, however, may have to be propa- 
gated several times before the next sequence 
is performed. PR-3 is used to accomplish 
this when necessary (not required in this 
example) and PR-2 holds its place as a 
reset address.] 

The tens digit of the multiplicand is obtained 
on the tenth cycle, using the address from 
OR-3 which is read to MAR, decremented by 
one, and placed back in OR-3. * The tens 
digit travels from MDR to MAR Tens as before, 
and simultaneously the output of the multiplier 
register and doubler is placed in MAR Hun- 
dreds and MAR Units, completing the address 
(Figure 10). The flag bit on the multiplicand 
digit is sensed on this cycle, and a flip-flop 
is set to start the procedure over again with 
the units digit of the multiplicand when this 
sequence is completed. 


* For purposes of clarity in explaining the 
example chosen, development of the next par- 
tial product using the tens digit of the multi- 
plicand is shown as additional unique cycles. 
Actually, the logic of the multiply timing ring 
would cause a return to the second cycle above 
except that (1) the addresses now in MARS 
would be utilized, and (2) PR-2 would be used 
in the fourth cycle instead of PR-1. 


Again, reference is made to the multipli- 
cation table in memory (eleventh cycle) using 
this manufactured address (dashed coordinates 
in Figure 7). The two-digit product of 4 and 9, 
with digits transposed, is read to MBR-Even 
and MBR-Odd and thence to the digit register 
as previously described. 

PR-2 is used in the twelfth cycle to obtain 
the previously developed partial product tens 
digit from memory. This digit and the product 
units digit are used to form a sum address as 
before (Figure 11). 

An add cycle (thirteenth) is again carried 
out, using this manufactured address to refer 
to the add table. The one -digit sum is read to 
MDR from memory. Since the sum of the two 
digits is greater than 9» a flag (carry indication) 
is detected which sets up the gating to include 
the carry on the next add cycle. 

Using the address in PR- 2, the new partial 
product tens digit (without its flag) is read back 
into memory (fourteenth cycle). In parallel with 
the reading of the digit back into memory, the 
address is returned to PR- 2 and PR- 3, decre- 
mented by one. At the end of this cycle, the 
MARS addresses are: 


OR-1 

00422 

OR- 2 

12047 

OR-3 

12045 

PR-1 

00098 

PR- 2 

00097 

PR- 3 

00097 


It is then necessary to read out the hun- 
dreds digit of the partial product (fifteenth cycle); 
PR- 3 is used. It is placed in MAR and replaced 
unchanged in PR- 3. This digit and the product 
tens digit, from the digit register (tens), form 
a new sum address except that gating through 
the plus 1 switch provides the effect of the carry 
from the previous add cycle. 

The sixteenth cycle of this sequence is then 
executed, using the manufactured address in 
MAR (00304) with memory yielding the sum 
digit to MDR from the addition table. 

On the last cycle of this sequence, PR- 3 
is used to place the new hundreds digit of the 
partial product back in memory. The partial 
product is now complete, using both digits of 
the multiplicand and the units digit of the 
multiplier. 

We now return to the first cycle, where a 
new multiplier digit is required and OR-1 is 
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used to obtain it from memory (Figure 12). 

The flag is detected on the digit, which sets 
flip-flops to terminate the execution of the 
instruction when the flag is next detected in the 
multiplicand. This new digit is stored in the 
multiplier register, and OR-3 is reset equal 
to the content of OR- 2, providing the return 
path to the units digit of the multiplicand. 

From this point on the multiplication 
proceeds, developing the remaining digits of 
the product in a manner similar to that pre- 
viously described. It should be noted that 
PR-1 contains the proper address at which the 
next digit of the partial product will be 
developed. PR-2, therefore, will be reset to 
this address at the proper time and PR-1 
decremented by one so that a new reset 
address is then available. It will not be needed 
in the development of this product, however, 
since the complete product will be developed 
before it is again required. 

On the last memory cycle in the develop- 
ment of the complete product, a flag bit is 
placed over the high order digit of the product. 
No restriction exists on the maximum length 
of the product except that only 20 digits of the 
product area are cleared to zero in the execu- 
tion of the multiply instruction. Should the 
product exceed 20 digits, additional program- 
ming steps are required to clear to zero the 
appropriate number of digits in excess of 20 
before multiplying. 

Multiplication of two fields requires 
approximately Q(8. 4P + 2) machine cycles, 
where Q and P equal the number of digits in the 
multiplier and multiplicand, respectively. 


Conclusion 

The system presented above describes 
the arithmetic unit incorporated in the first 
engineering model of the IBM 1620 Data 
Processing System. A refinement of this 
table lookup arithmetic is included in the pro- 
duction version of the 1620. No division 
hardware was provided because the use ratio 
of such an instruction to others was found to 
be small. Since other features of the 1620 
provided convenient subroutine programming, 
division was included as part of the general 
program package of mathematical subroutines 
(sin x, log x, e x , etc. ) included with the 
system. 

While table lookup arithmetic is employed 
in the 1620, the arithmetic operations described 
above are applicable to any system having the 
necessary organization. It is important to note 
that the speed of the arithmetic is directly 
proportional to the machine cycle. Also, the 
machine organization described in this paper 
lends itself to a table lookup divide instruction 
using the existing components; only economic 
justification and the divide timing ring are 
required. 

Elimination of the expensive adder cir- 
cuitry heretofore required was made practical 
by using many of the components needed by the 
system for other operations. The low cost, 
high performance core storage, for example, 
provided the table area and the speed required 
for efficient arithmetic operations. The MARS 
array and registers, *too, were required for 
other uses by the instruction set. That being 
the case, a few extra components and a logical 
sequence provided the same final result as a 
conventional adder, but at a reduced cost. 
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ADDITION TABLE (FOR USE IN EXECUTION OF ALL ARITHMETIC INSTRUCTIONS) 
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UNITS DIGIT TIMES MULTIPLICAND UNITS DIGIT) 
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MULTIPLICATION TABLE ( FOR USE IN EXECUTION OF MULTIPLY INSTRUCTION) 
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Figure 11 

DATA FLOW - SUM ADDRESS GENERATION 
(PRODUCT UNITS DIGIT PLUS PREVIOUS PARTIAL PRODUCT TENS DIGIT) 
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ON MICROELECTRONIC COMPONENTS , INTERCONNECTIONS , 
AND SYSTEM FABRICATION 

Kenneth R. Shoulders 
Stanford Research Institute 
Menlo Park, California 


Summary - -Microelectronic data processing systems 
are analyzed and various requirements for compo- 
nents are considered. The rapid reduction in 
transmission line cross section upon scaling down 
causes increased losses in microelectronic systems 
thus giving rise to the need for high impedance 
components for non-cryogenic applications. A new 
component is proposed that seems particularly 
suited for microelectronic system requirements and 
fabrication methods. This component is based upon 
the quantum mechanical tunneling of electrons into 
vacuum; has an estimated switching time of 10 4 9 
seconds ; promises immunity to temperature varia- 
tions; and seems adaptable to self-forming manu- 
facturing methods giving component uniformity. A 
method of electron beam activitated micromachining 
for film materials is presented in which a thin 
chemically resistant film is formed with an elec- 
tron beam to selectively protect the film being 
machined during a subsequent molecular beam etch- 
ing. This high speed process has resolution capa- 
bilities of several hundred angstrom units; can 
process electronically clean materials with mini- 
mum contamination; and may ultimately be suited 
for the economical production of one cubic inch 
data processing systems having 10^ active compo- 
nents. 


The Over-All Requirement 

We want to build electronic data processing 
systems that have the complexity of human neural’ 
networks but are capable of operating with elec- 
tronic speed. These machines should be capable of 
solving very complex problems such as those that 
arise in nonlinear systems, magnetohydrodynamics, 
pattern recognition from the real world, self- 
organization and learning, and should in general, 
be very useful assistants in our society. 

Having once obtained such a machine, the need 
will invariably arise to make it highly portable 
so that it can cope with problems where they exist 
instead of depending on problems being brought to 
it. Everyone should have such a personal assist- 
ant. 

The realization of such a machine requires an 
extremely high degree of organization of matter 
and may not be economically permissible unless a 
high speed electronic construction process is used 
for specifying the end product. Admittedly, this 
is not the next generation of equipment to be ex- 
pected. Our aims here are to try to achieve what 
seems possible with electronics and not base our 
work upon an incremental advance from present de- 
vices. 

There has been a tendency lately to conceal 
the need for high resolution processing methods 


that are expected to work, 
trarily 10 components for our system. 


through the use of such terms as "molectronics, " 
or by talk of distributing this-or-that . There 
always seems to be a discrete device, even if com- 
posed of distributed parts, which serves our re- 
quirements in a data processing machine. It seems 
likely that this will continue for some time. 

Our problem then is how to handle material 
with enough resolution to fabricate 10 11 devices 
in a size that is portable and then to find an in- 
teresting configuration for these materials so as 
to give a desired electronic result. 

But before we start building things at random, 
we might do well to look at some of the properties 
of electrical devices in the micro-world, and thus 
avoid some lost motion by only working on things 

We have picked arbi- 
Let us 

see what this implies in terms of physical size 
for the machine. 

The most highly resolved construction process 
that we have any control over is what we call 
electron beam activated micromachining. This pro- 
cess has been shown to give some results in the 
100-angstrom-unit region, but let us degrade this 
to 300 angstroms for the moment and see what we 
can make with it. If an arbitrary component can 
be made from about 1000 of these 300 angstrom unit 
dots, as seen from the plan view, then the outside 
dimensions of the component would be about one mi- 
cron. The thickness could be expected to be an 
equivalent value. If three of these one-micron 
sized areas are used for interconnections for 
every component, then we have two microns of 
linear dimension per component and 10 4 per inch. 
One square inch of area would contain 10° elements. 
If ten layers of components per module were used 
there would be 10 9 components. One hundred modules, 
each being 0.010 inch thick for support purposes, 
would stack to a one- inch cube having 10 1 compo- 
nents . 

Before we go into the insides of this system, 
let us look at some external problems. If we wish 
to pick up optical data we find that a one-inch 
cube made by stacking wafers has just two surfaces 
available for communication with optical patterns 
from the outside world. One of these would be ex- 
pected to receive light images, and the other to 
generate light patterns for human use. 

There will be a noticeable lack of lead wires 
at the edges of the modules, compared to the num- 
ber of components on the module. For communicat- 
ing between modules, wide-band serial techniques 
could be used for the few wires available at the 
edge and the surfaces could be used by allowing 
light coupling between modules. It might be ex- 
pected that up to 10 4 parallel paths could be 
achieved in this way without the use of a lens. 

The bandwidth would seem to depend upon the light 
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detector, and may fall in the 10-megacycle range. 

In changing the size of things so radically — 
about three orders of magnitude — we will be well 
advised to look over old components to see if they 
can become useful in this size range. For example, 
we may reconsider a mechanical electrostatic relay 
because now it could have vacuum contacts and 
operate at low voltages in the 10-megacycle range. 
We could look again at conventional vacuum tubes 
and consider the negligible transit time lag and 
absence of bothersome space charge effects because 
of the high fields. Thermal devices will be found 
to have time constants in the microsecond range. 

Scaling of Electrical Properties 


Not all effects are beneficial when sizes are 
reduced. One of the most disturbing changes is 
the increased electrical loss of conventional 
wires . 

The effect of scaling down in size on certain 
fundamental electronic components can be shown by 
reviewing the behavior of inductance, capacity, 
and resistance. The values of inductance and 
capacity vary directly with their size, while re- 
sistance has an inverse behavior. If all dimen- 
sions of a standard sized tuned circuit were 
reduced by 10 4 to make it about a micron on a side 
then the resonant frequency would be raised by 10 4 . 
The Q at this new frequency has been decreased by 
10 4 , and would be reduced by 10 8 at the original 
frequency. A search for new filtering methods is 
in order. R-C filters, and methods of producing 
effective inductance could be used, but their sta- 
bility has never been adequate. This new filter 
may be the logic of a pattern recognition machine. 
Serial communication filtering could be handled by 
sample data techniques coupled with appropriate 
quantization and logic circuits — in other words, 
the digital filter. In short, the communication 
set of the future would become a computer. These 
techniques would not be thrifty in their use of 
components and it should be expected that vast 
numbers of components would be needed. 

In addition to an increased loss for tuned 
circuits, we find that the same mechanisms are at 
work to cause excessively lossy transmission lines. 
For example, a 3000-angstrom-diameter line having 
a length of one inch would have 100,000 ohms DC 
resistance. The added high frequency loss of the 
line would be quite severe. A clear dictate for a 
micro-component wired to its neighbors in the 
classical way is that it be a high impedance com- 
ponent. In general, electric field operated de- 
vices like tubes and transistors have high imped- 
ance while magnetic components have low impedance. 
This high impedance does not necessarily mean long 
switching times in microelectronics. Consider 
that the distributed capacity of a one-micron de- 
vice may be as high as 10 -15 farad, and if this is 
associated with a 100,000-ohm resistor a 10~ 10 
second time constant results. 

Properties of Materials 

We should now look into the properties of 
materials in the sub-micron region. The resisti- 
vity of conductors is very near that of bulk 


materials for^a 3000-angstrom-diameter trans- 
mission line, but we are encouraged to find that 
dielectrics can have breakdown strengths several 
orders of magnitude higher than bulk materials. ^ 
Breakdown values of 3 x 10^ v/cm for a 2000- 
angstrom film of aluminum oxide is typical. This 
is primarily because the thickness is near the 
value for the mean free path of an electron in the 
dielectric, and an avalanche is not allowed to 
occur under such conditions and breakdown is fore- 
stalled. One should be motivated to look into 
high field strength electrostatic devices in order 
to take advantage of this effect. 

Some of the mechanical properties of micro- 
structures seem to make things difficult for us. 
For example, it seems much more difficult to grow 
a highly ordered single crystal film than a bulk 
single crystal, especially on an amorphous or 
micro-polycrystalline substrate that may be re- 
quired for multi-layer component construction. 

Surface tension forces play a very important 
role in certain sub-micron configurations ; these 
powerful forces try to form our carefully shaped 
parts into tiny spheres which may not be use- 
ful. 3 ’ 4 

When the quantity of material in our compo- 
nent is reduced to the level required, extreme 
precautions must be taken to prevent either self- 
diffusion or diffusion of foreign materials. 

These microelectronic components can no longer 
average out the effects of foreign material. 

"Well, I haven’t looked that closely "but the 
thing that does surprise me is that the effects 
are as big as they seem to be if indeed they are 
caused by what I suppose." (anonymous contribu- 
tion to discussion on surface phenomena) 

As the size of our device is reduced the 
surf ace-to-volume ratio rises and surface recom- 
bination effects may become very difficult to cope 
with. 

This increase of surface-to-volume ratio 
brings with it a fantastic cooling mechanism for 
individual micron-sized components. It has been 
found in relay contact research, and particularly 
by x-ray micro techniques 5 , that the power densi- 
ty into a one-micron area can be 10 8 watts per 
square centimeter continuously without material 
decomposition. Of course, it would not be expec- 
ted that many components in the same area could 
operate at this level without catastrophic effects. 

Tunnel Effect Vacuum Tube 

We propose to take advantage of some of the 
effects found in the micro-world and incorporate 
them into a component based upon the quantum 
mechanical tunneling of electrons into vacuum. 
Figure 1 shows one configuration that may prove 
interesting. 

The operation of this device may be divided 
into two parts to help our understanding of it — 
namely obtaining electron emission, and using 
these emitted electrons . 

The cathode properties have been investigated 
by many workers and there is general ’agreement 
that the current density can be 10^ amps per 
square centimeter with only very small space 
charge effects. 6 This is due to the high field 
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strength of a few times 10 volts per centimeter. 
The current from this source can be varied over 
seven powers of 10 by a change in field of 2 to 1 
at the cathode, implying high gain possibilities. 
Tests at 10,000 megacycles have verified that 
there is no detectable deviation from the DC 

7 

emission characteristics. 

The variation of grid voltage would cause the 
necessary field change at the cathode. A measured 
velocity distribution of 0.14 electron volts shows 
that this is not a noisy source.® 

Our intentions are to form an array of small 
tips superimposed on the cathode shown in Fig. 1. 
These tips should be a few hundred angstrom units 
in diameter in order to obtain emission below 100 
volts when applied to the screen grid or anode. 

The cathode shown in Fig. 1 is nominally 3000 
angstrom units wide. 

Electron optical considerations dictate how 
the emitted electrons are to be used. A positive 
control grid would normally be used, but this 
would not draw appreciable current because it is 
located away from the electron path between 
cathode and anode. Negative grids are possible if 
they are smooth enough or have high work function 
so as to prevent emission. It is desirable to 
collect the electrons at a low plate potential, 
thus avoiding unnecessary heating. Potentials as 
low as a few volts would seem possible because the 
field strength would still be sufficient to allow 
collection of electron current densities of 
greater than 10,000 amps per square centimeter 
without adverse space charge effects. 

The screen grid would remain at some level in 
the 100-volt range acting as a bias to help cause 
field emission, but not collecting an appreciable 
current . 

The upper limit in switching speed for this 
device would be set by the allowable power dissi- 
pation. Using an anode voltage of 10 volts, a 
current of 100 microamperes, a capacity of 10~-*-^ 
farad, then this one-micron-sized device would 
show a switching time constant of lO - ^-® seconds 
for a power density of 10^ watts per square centi- 
meter — well within the allowed value for a single 
component . 

Our completed machine composed of tunnel 
effect elements could have a maximum input power 
of 100 watts — as determined by heating considera- 
tions. The one milliwatt of input per device 
would allow 10 5 devices to operate simultaneously 
with a 10~10 second switching time which gives a 
bit rate of 10 ^ p er second. 

The transit time for electrons would be about 
10 -13 seconds, and this would seem to remove the 
need for complicated traveling wave devices to 
obtain high frequency gain. 

Environment . Tunnel effect cathodes have been 
found to be insensitive to variations in tempera- 
ture between 4°K and 2000°K. The complete tunnel 
effect device would seem to be similarly indepen- 
dent of temperature although a high temperature 
limit is clearly predicted by conduction within 
the dielectric. 

Mechanical forces would not likely effect 
this microscopic device unless a physical crack 
resulted in the construction materials. 
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Ionizing radiation does not affect the cathode 
properties or other metal electrodes until the 
dielectric has been severely damaged. The choice 
of dielectrics for this device is very wide, but 
materials like aluminum oxide or beryllium oxide, 
in film form, would be chosen because of their 
stable characteristics under bombardment. 

Reliability . It is obvious that a device is 
perfectly reliable if none of the material compos- 
ing it is allowed to move or migrate. This condi- 
tion for stability is most easily met by using only 
refractory metals and dielectrics and removing all 
foreign materials from the system. Tunnel effect 
devices need only one type of conductor and one 
type of dielectric; these materials are chosen 
primarily for their stability, and no compromise 
has to be made to win some electrical effect. In 
addition to stable components, it is necessary to 
have dense layers of encapsulating materials to 
prevent migration of materials from the outside 
surfaces into the active areas. Microscopic vacuum 
and solid state devices suffer to an equal extent 
from this migration. To assist the application of 
dense outer layers it is beneficial to use high 
temperature processing for encapsulation. 

Several thousand hours of operating life is 

presently obtained with gross field emission de- 

7 10 

vices operated at 10' amps per square centimeter. 

The cause for deterioration is inevitably due to 
contamination of the cathode or sputtering of the 
cathode with ions formed from gas contaminants that 
migrate into the area. By reducing the area of the 
container, a large source of contamination is re- 
moved. Lower operating voltages reduce the sput- 
tering effects, and a clean construction process 
would remove the remaining offenders. The field 
emission cathodes now in existence are the only 
experimental examples the electronics industry has 
which operate with the microscopic areas of 
10 -10 cm 3 that we anticipate using. We should be 
warned that the stability problem for all devices 
of equivalent area is severe, and then take steps 
toward clean construction methods. 

Reproducibility . Vacuum tunnel effect devices 
have a very high probability of being self-formed 
into vast arrays having uniform electrical proper- 
ties. This seems to be a unique advantage not 
shared by many types of components. The principal 
requirement for device self -formation is that a 
chemical process be effected by a significant 
electrical property of the component in such a way 
that the component is altered to a new form. The 
simplest illustration of this effect is the forma- 
tion of an electrolytic capacitor in which the 
voltage controls the dielectric thickness. Addi- 
tional requirements for self-formation are that all 
chemical residue may be driven from the component 
and that the uniformity will not be altered by this 
material being removed. 

Cryogenic devices seem to have a negligible 
possibility of self-f ormation ; magnetic devices a 
somewhat higher possibility, but still fairly low; 
semiconductor devices would be next in line; and 
then vacuum devices, which would be the easiest 
group because of the constant accessibility of the 
electronic surface being formed to the chemicals 
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responsible for forming. It is advantageous to 
have an electronic device that is insensitive to 
temperature variations for self-forming methods, 
because all too frequently chemical action cannot 
be made to take place at the component operating 
temperature. Once again vacuum devices, and es- 
pecially tunnel effect vacuum components show 
their advantages. 

In the self-formation of our tunnel effect 
components, the device would be made as near as 
possible to the final shape by electron beam acti- 
vated micromachining with the many small cathode 
tips being formed too sharp for use. A voltage 
would be applied to all electrodes simultaneously 
in the presence of a molecular beam etchant and 
at an elevated temperature. The voltage would be 
raised until the sharpest tips began emitting. 
These emitted electrons have been found to accel- 
erate the etching of small tips which in turn 
tends to increase the tip radius and reduce emis- 
sion. By gradually increasing the applied voltage 
up to the operating value, all of the excessively 
sharp tips are degenerated to a constant emission 
value. The entire array of components would now 
be heated under ultra high vacuum conditions and 
sealed with encapsulating layers of materials. 

This self-forming process has taken into consid- 
eration the pertinent spacing variations as well 
as the cathode properties. 

Interconnection . We intend to employ vacuum 
tunnel effect devices in a system that uses only 
optical input converters, electroluminescent gen- 
erators, and a means for interconnecting compo- 
nents. No lumped capacitance, inductance, or 
resistance seems needed. 

Since complete vacuum tunnel effect devices 
do not exist, detailed electrical characteristics 
are not known and it would be a waste of time to 
develop elaborate circuits for this hypothetical 
device. Still, there are many things that can 
guide us toward the eventual circuits. 

Direct coupling between devices is facilita- 
ted by the positive grid characteristics. It has 
been found that the voltage-vs .-current character- 
istic of a vacuum tunnel diode behaves like a 
normal diode with a battery in series with it, 
thus forestalling conduction until the voltage has 
been raised appreciably. This bias-battery effect 
is also beneficial for direct coupling. 

When an output is sought that takes either 
one of two states — B+ or ground — two tetrodes are 
used in series across the power input, one serving 
as a switchable load resistor for the other. In 
the type of microelectronics described here, every 
effort should be made to conserve energy, and the 
arrangement just described allows the capacity of 
the switched point between the two devices to be 
charged or discharged quickly without using a low 
value load resistor that requires the constant 
dissipation of energy for one of the switched 
states . 

If resistors are excluded from our circuits 
then it will be found difficult to discharge grids 

under certain conditions. Here we would like to 

12 

invoke the use of secondary emission. > The 
time delay from secondary emission surfaces is so 
short that it has never been measured. The cur- 


rent density available is arbitrarily high, up to 
the point that space charge or heating effects set 
in. The stability for properly cleaned surfaces 
would be the same as for equally clean field emis- 
sion cathodes; thus we may feel free to employ 
this effect in clean microelectronic systems. The 
effect we seek is to drive a control grid positive 
when it receives a burst of electrons with enough 
velocity to exceed the secondary emission one to 
one point. In this way, a floating electrode may 
be driven positive or negative by changing the 
velocity of the electrons or by deflecting a beam 
of constant velocity to either the high or the low 
secondary emission areas of the electrode. 

We would employ active memories of the nega- 
tive resistance or dynatron type using secondary 
electron emission in this microelectronics system. 

A memory device of this type can have an almost 
arbitrarily low power consumption — limited by the 
leakage of the dielectric. The storage capacity 
of this memory must be charged and discharged by 
very energetic devices such as tunnel effect 
tetrodes in order to operate in the 10 -10 second 
region. 

As an alternate memory method we could use 
flip-flops having additional tunnel effect tetrodes 
to replace the conventional plate load resistors. 
The proper feedback to the variable load resistors 
serves to provide a low impedance for quick char- 
ging and a high impedance for the quiescent 
period. 

These memory configurations reduce the cur- 
rent during the quiescent period to a value just 
necessary to prevent dielectric leakage from 
changing the stored value. A large array of these 
elements in quiescent state would draw a current 
equivalent to a slightly leaky capacitor of com- 
parable area and thickness. Accidental removal of 
the voltage source from the system would have no 
immediate effect. 

In future systems we shall want to dispose of 
some of the wires for interconnecting devices . 
Besides being a lossy means for conveying informa- 
tion in the micro-world, wires are a very inflexi- 
ble arrangement for connecting components. We 
would like to have a completely uniform array of 
low complexity modules that are not interconnected 
in the beginning. A "path" should be built as the 
machine learns, and this path should be built by 
entirely electronic methods making efficient use 
of the components. An electrically steerable, 
periodically focused electron guide, sometimes 
called slalom focusing, 13 seems to answer our 
needs. This system makes use of the ballistic 
properties of free electrons and is able to guide 
bundles of electrons around devious corners, cross 
beams in the same plane without interference, ob- 
tain persistent current loops, and potentially 
cause certain beam-beam interactions that can do 
logic. 

One configuration that appears useful is to 
locate our periodic array of modules — capable of 
launching, receiving, and steering electrons — be- 
tween two conductive planes at cathode potential. 
This layer of components would then contain about 
half as much material as the wired method. Voids, 
for the potential electron paths, would replace 
the wires. The potential of the outer electrode 



of our module would guide the electrons, the 
potential being determined by an internal memory 
and past history. The module would be expected to 
carry out steering functions and logical functions 
independently so as to increase the logical free- 
dom of the system. 

Once again our high field strengths would 
provide the means for suppressing space charge ef- 
fects . The transit time for a 100-volt beam would 
be about one hundredth the velocity of light — not 
a bad price to pay when the dispersion effects on 
a lossy transmission line are considered. 

Having once entered the electron beam realm 
we are able to use parametric amplifier techniques 
and achieve low noise amplifiers in spite of high 
lattice temperature. 14 

Light Coupling. For coupling light into our 
system, we will invoke nothing more sophisticated 
than a photosensitive surface connected to the 
grid of a tunnel effect amplifier, either directly 
or through secondary emission electron multipliers. 

For output displays, field emitted electrons 
can cause DC electroluminescence effects, or in a 
less advanced scheme simple bombardment of a film 
of phosphor will suffice. If our systems are to 
be immune to temperature effects up into the red 
heat range, blue emission would seem most interes- 
ting. A blue sensitive photo surface has a high 
work function and can be stable in this tempera- 
ture range . 

Construction Process Requirements 

We now need a fabrication method to make our 
devices. Although vacuum tunnel effect devices 
have been emphasized we do not intend to exclude 
any known electronic component from our repertoire. 

It may seem at first that we have taken on 
some especially difficult problems like vacuum en- 
capsulation, but a closer look reveals that this 
is no more difficult than filling in unused space 
in a solid state system. Present film methods do 
not see the problem because the edges of the film 
are so diffuse that a grading effect occurs be- 
tween adjacent elements. When components are made 
that have micron dimensions on all sides, high 
resolution filling of voids is necessary to pre- 
vent a self-induced pin-hole effect. Converting a 
small open cavity into a small closed vacuum cavi- 
ty can be done as easily as filling voids in one 
layer so as to smooth the surface, before pro- 
ceeding to the next layer. This smoothing seems 
necessary regardless of the component used. 

By being able to support a very thin film in- 
side of a vacuum cavity, infrared or thermal de- 
tectors could be partly isolated from the lattice. 
Mechanical devices like electrostatic relays or 
acoustical filters also need this isolation. 

Material Deposition . We have elected to use 
vacuum deposition methods to obtain our starting 
materials because this is a convenient method of 
producing a wide range of clean materials . Ther- 
mal evaporation has been used to produce photo 
conductors, phosphors, magnetic materials, super- 
conductors, metals, dielectrics, and semiconduc- 
tors. Reactive deposition methods have produced 
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the same group of materials, but have superior 
crystalline properties and greater stability. Re- 
active deposition is usually carried out by reac- 
ting some metal halide with a reducing or an 
oxidizing agent. 

These reactions are most conveniently carried 
out in a high vacuum system as shown schematically 
in Fig. 2. For the deposition of molybdenum, one 
of the evaporation sources becomes molybdenum 
pentachloride and the other is hydrogen, made by 
thermally decomposing a material like zirconium 
hydride. The evaporation rate of these materials 
is monitored by separate ion gages and adjusted by 
external regulators. Upon reacting at the heated 
substrate, molybdenum is deposited and hydrogen 
chloride is pumped away. By this same method, 
aluminum oxide films can be formed when aluminum 
chloride and water vapor are reacted. Pi"n hole 
free deposits are obtained because there is no ash 
or lumps from the sources that are not immediately 
volatilized at the surface, and in addition the 
high mobility of the molecules on the surface do 
not cause shadowing effects from dust specks. 

There is less strain in the deposited films since 
the substrate can be much hotter for reactive de- 
position than for a conventional thermal evapora- 
tion. This higher temperature is possible because 
there is a mechanism for carrying away the energy 
of the condensing molecule when the volatile 
product of the reaction leaves the surface, Reac- 
tively deposited materials can be graded to the 
underlying film in an optimum fashion — not so 
little that the films tear or peel, and not so much 
as to ruin the sensitive electronic surfaces. 

Doping can be added to semiconductors with reason- 
able assurance of going in properly. 

Material cannot be accurately localized by 
masking at the surface with reactive deposition, 
as it can with thermal evaporation. We are thus 
led to seek a method of selective removal for the 
films that are produced. 

Material Removal . Any film that can be de- 
posited can be etched away in the same vacuum sys- 
tem by using a molecular beam of the proper com- 
pound or element. If one of the evaporators in 
Fig. 2 becomes a source for chlorine, and the pre- 
viously deposited molybdenum film is heated to 
around 500°C, the film will be converted into 
molybdenum chloride which evaporates from the sur- 
face. Aluminum oxide films can be etched in a 
phosgene molecular beam. 

Etching in a vacuum system carries many advan- 
tages with it. For example, if etching effects are 
desired that cannot be obtained with molecular 
beams, then atomic beams can be used. The lack of 
surfaces between the source and the substrate pre- 
vent recombination of the atomic species. A sput- 
tering etch can be easily carried out in the same 
apparatus . 

It is desirable to have an etching reaction as 
efficient as possible to produce the cleanest sam- 
ples. Any excess materials, like carrier gases, 
coming into contact with the surface raise the 
possibilities of contamination. 

The Resist . It will be found that certain 
thin layers of material can protect the film below 
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them during etching and thus become a resist. 
Amorphous silica has been used as a resist for the 
selective etching of molybdenum and aluminum oxide 
as well as many others. This silica layer need be 
only about 50 angstroms thick for the protection 
of several thousand angstroms of underlying film 
material. After serving its purpose as a resist, 
the silica can be removed with a brief hydrogen 
fluoride molecular beam etch. Many resists are 
available, and simple chemical selection rules can 
determine which one is the best choice for any 
particular requirement. 

The resist layers can be thermally evaporated 
onto a surface through masks, but this would not 
serve our purposes of having a high speed, elec- 
tronically controlled, high resolution process. 

We have found ways of producing a silica resist by 
bombarding certain siligon-containing organic com- 
pounds with electrons. 1 The most common example 
is the bombardment of tetraethylorthosilicate . 

This substance is a liquid at room temperature and 
evaporates to the substrate where it probably 
forms a multi-molecular adsorbed layer. When this 
film is bombarded for about one second with a beam 
of 10 milliamperes per square centimeter current 
density, enough silica is produced for our pur- 
poses. The mechanism for silica production is not 
known in detail, but it is assumed to be a free 
radical polymerization of the organic followed by 
decomposition to silica upon heating to etching 
temperatures. 

This gas-like compound is not easy to handle 
because it tends to enter the electron lens and 
contaminate the surfaces. An alternate material 
that also produces silica, but is easier to handle, 
is triphenylsilanol. This is a solid that is 
evaporated to the surface and then exposed as des- 
cribed above. Immediately after exposure no 
change is evident, but heating causes the unex- 
posed areas to evaporate and the exposed portions 
to remain as a silica-like deposit. 

The efficiency of this reaction is such that 
an average of one molecule of silica is produced 
for each electron. More sophisticated methods of 
resist production have given yields of over 10 3 
molecules per electron by causing current multi- 
plication within the sample being exposed. 

The resolution of resist production has been 
seen to exceed 100 angstrom units. This is ob- 
served by exposing a surface of resist producing 
material with an oblique flood electron beam in 
such a fashion that shadows are cast by objects on 
the surface. These objects are usually polysty- 
rene spheres about 880 angstrom units in diameter. 
An electron microscope is used to view the result. 

Electron-Beam Activated Micromachining . It 
can be seen that electron-beam micromachitting is 
the combination of certain methods of deposition, 
resist production, and etching. Our initial re- 
quirement for this fabrication process was to 
economically produce a one-cubic-inch system of 
10 11 parts being composed of 100 modules; each 
having 10 layers of 10 8 components per layer. How 
much time would it take to construct the system? 
Let us find how long it would take to make a 
single layer of 10 8 components, and then multiply 
by 1000 . 


The substrate must first be very carefully 
prepared by grinding, polishing, preparing ter- 
minals, vacuum firing, re-polishing, and then 
final cleaning and smoothing in the ultra high 
vacuum processing chamber. The substrate can be 
an expensive item. 

For each layer of components there may be an 
average of four depositions and four etchings re- 
quired. About five minutes per deposition and 
etching is possible, which includes time for 
heating and cooling. This totals forty minutes so 
far. 

g 

The 10 components per layer and their inter- 
connections can be represented as an array of 
about 10 11 300 angstrom unit dots. This is ar- 
rived at by remembering that each component is 
composed of 10 3 pieces — our basic building blocks. 
It has been found that high resolution electron 
optical systems can easily have 10 8 bits in one 
field of view— a consequence of the small aperture 
angles used. A gross image could thus contain 
10^ complete components per field. We will be re- 
quired to mechanically move the substrate in front 
of the lens to produce our latent-resist image. 
1000 steps are needed to make 10 8 components. 

Each exposure can be made as rapidly as the elec- 
trical and mechanical system is able to servo and 
register cn .ie preceding layers — about one tenth 
of a second. The high current density of the 
electron beam is able to expose the image in a 
negligably short time. One thousand steps would 
take about two minutes per layer or eight minutes 
for four layers. We have now accumulated 48 min- 
utes to process one layer of components. 

The production of an entire machine would 
take 1000 times 48 minutes or about 800 hours. 

We have elected to use an electron beam to 
convey information to a surface to describe the 
patterns of material that will be successively 
built up to form a module. Besides the obvious 
advantage of having short wavelength, electron 
beams have extremely bright sources — i.e., many 
events per unit time; they are electrically steer- 
able, and they interact well with matter. Beams 
have been used to analyze the properties of matter 
such as absorption spectra, crystalline diffrac- 
tion effects, chemical analysis bv x-ray fluores- 
cence, 1 ^ geometrical arrangement, 18 contact poten- 
tial, and surface cleanliness. 13 In addition, 

beams have been used in electron mirror micro- 
20 

scopes to show voltage, magnetic field, and re- 
sistivity. We intend to integrate all of these 
functions into a single electron optical system 
so that our machine tool becomes an analytical 
tool as well. 

Electron Optics . The electron optical system 
employed must integrate the functions of a scan- 
ning electron microscope, scanning x-ray fluores- 
cence probe, and mirror microscope. This instru- 
ment is required to operate under ultra high 
vacuum conditions so that conventional electron 
microscope designs having many rubber gaskets will 
not do. We have elected to make an all-ceramic 
electrostatic instrument, with metallized sur- 
faces, in order to obtain the necessary mechanical 
stability during the vacuum bake-out cycle. 

When completed, this instrument should be 



capable of machining or viewing structures down to 
a limit of 100 angstrom units resolution. The x- 
ray fluorescence probe could measure film thick- 
ness to one tenth percent accuracy and be able to 
carry out a quantitative chemical determination 
on 10 -13 grams of material to one percent accuracy. 
The mirror microscope feature could measure the 
voltages on any element of our components that lay 
on the surface. This voltage measuring method 
could be converted by simple electrical switching 
into a very wide band oscilloscope to show the 
dynamic behavior of the components . 

In addition to the functions of magnification, 
demagnification, manipulation, and electron and 
x-ray detection, the electron optical system must 
have a pattern generator integrated with it. The 
simplest means of conveying information to the 
surface is to use a scanning electron beam, but it 
will be found that this method is severely band- 
width and energy limited for complex patterns. 

If a perfectly repetitive pattern is required 
for each of our 1000 exposures per layer, then a 
simple large-scale mask is needed near the elec- 
tron source. If some sub-module exists in irregu- 
lar order throughout our system then this can be 
obtained by a mask and then be deflected electri- 
cally to the desired location. 

Since this electron beam machining process 
requires such a low current density, purely elec- 
trical methods of pattern generation may be in- 
voked. A slowly scanning input beam can write a 
charge pattern on a storage screen which in turn 
controls the transmission of electrons . Ultimate- 
ly, we would vise a vacuum tunnel effect cathode 
array for our electron source. The emission from 
discrete areas would be controlled by local grids 
connected to micro-memory elements which are set 
by a slowly scanning beam or a microelectronic 
stepping system to write in the pattern changes 
that are required. Thus we have components made 
by electronic micromachining responsible for the 
building of new systems by the same method. In 
the end, self reproduction would be a distinct 
possibility without the use of a lens system, be- 
cause all copies would be made on a 1 to 1 size 
basis. 
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SUMMARY 


The paper first discusses an example of a com- 
puter, intended as a prototype of a practical com- 
puter, having an iterative structure and capable 
of processing arbitrarily many words of stored 
data at the same time, each by a different sub -pro- 
gram if desired. Next a mathematical characteriza- 
tion is given of a broad class of computers satis- 
fying the conditions just stated. Finally the 
characterization is related to a program aimed at 
establishing a theory of adaptive systems via the 
concept of automaton generators . 


I 

INTRODUCTION 


Computers constructed of hundreds of millions 
of logic and storage elements will require an or- 
ganization radically different from present com- 
puters if the elements are to be used efficiently 
in computation. It should be possible to process 
arbitrarily many words of stored data at the same 
time, each by a different sub -program if desired. 

In addition the structure of the computer should be 
iterative or modular in order to allow efficient 
use of template techniques in its construction. 

The present paper contains a mathematical char- 
acterization of a broad class of computers satisfy- 
ing these conditions. This class, with appropriate 
interpretation of the symbols, includes representa- 
tives structurally and behaviorally equivalent to 
each of the following types of automata: 

1) Turing machines (with 1 or more tapes 

2) Tessellation automata (Von Neumann, 

Moore) u' 

3) Growing logical nets (Burks -Wang )2 A 

4) Potentially-infinite automata (Church)3 
The class also contains automata which, in various 
senses, are generalizations of each of these four 
types . 

Ultimately, for the designer, the value of such 
a characterization depends upon whether or not it 
can actually suggest designs for solid-state com- 
puter. Section II of this paper discusses a possi- 
ble abstract prototype for such a computer — it is 
one of many alternatives which can be defined and 
investigated with the help of the characterisation. 
(A similar computer is discussed in greater detail 
in the Proceedings of the 1959 Eastern Joint Com- 
puter Conference )5. Section III summarizes the 


mathematical characterization and considers its 
interpretation. Section IV relates this paper to 
a program (begun by the author in 1958 ) which has 
as its objective a theory of adaptive systems. 


II 

AN ITERATIVE CIRCUIT COMPUTER 

The computer outlined in this section is pre- 
sented primarily to suggest something of the class 
of computers included in the characterization sum- 
marized in the next section. At the same time, 
however, the order code, addressing schemes, etc. 
where chosen to reflect their counterparts in 
present computers. In this sense, the computer 
can also be thought of as a prototype of a practi- 
cal computer — assuming that the large numbers of 
components required can be provided economically. 
Because the computer can execute an arbitrary 
number of sub -programs simultaneously, and because 
the sub -programs are spatially organized, its 
operation is of course considerably different from 
present computers . 

The computer can be considered to be composed 
of modules arranged in a 2-dimensional rectangular 
grid; the computer is homogeneous (or iterative) 
in the sense that each of the modules can be rep- 
resented by the same fixed logical network. The 
modules are synchronously timed and time for the 
computer can be considered as occurring in dis- 
crete steps, t = 0, 1, 2, . . .. 

Basically each module consists of a binary 
storage register together with associated circuitry 
and some auxiliary registers. At each time -step a 
module may be either active or inactive. An ac- 
tive module, in effect, interprets the number in 
its storage register as an instruction and pro- 
ceeds to execute it. There is no restriction 
(other than the size of the computer) on the num- 
ber of active modules at any given time. Ordinari- 
ly if a module M(i,j) at coordinates (i,j) is ac- 
tive at time -step t, then at time -step t+1, M(i,j) 
returns to inactive status and its successor, one 
of the four neighbors M(i+l,j), M(i,j+l), M(i-l,j), 
or M(i,j-l), becomes active. (The exceptions to 
this rule occur when the instruction in the stor- 
age register of the active module specifies a dif- 
ferent course of action as, for example, when the 
instruction is the equivalent of a transfer 
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instruction) . 

The successor is specified by bits Sq, s 2 in 
M(i,j)'s storage register. If we define the line 
of successors of a given module as the module it- 
self, its successor, the successor of the succes- 
sor, etc., then a given sub-program in the com- 
puter will usually consist of the line of succes- 
sors of some module . Since several modules can be 
active at the same time the computer can in fact 
execute several sub -programs at once. We have 
noted parenthetically that there are orders which 
control the course of action — there are also or- 
ders equivalent to store orders which can alter 
the number (and hence the instruction) in a stor- 
age register. Therefore, the number of sub -pro- 
grams being executed can be varied with time, and 
the variation can be controlled by one or more 
sub -programs . 

The action of a module during each time -step 
can be diveded into three successive phases: 

1) During phase one, the initial phase of 
each time-step, a module's storage register can be 
set to any arbitrarily chosen value and its auxil- 
iary registers to any desired condition. The num- 
bers and conditions thus supplied are the computer's 
input. Although the number in the storage register 
can be arbitrarily changed at the beginning of each 
time-step, it need not be; for many purposes the 
majority of modules will receive input only dur- 
ing the first few moments of time ("storing the 
program") or only at selected times tq, t 2 , . . . 
("data input"). Of course, some modules may have 

a new number for input at each time -step; in this 
case the modules play a role similar to the inputs 
to a sequential circuit. 

2) During phase two, an active module deter- 
mines the location of its operand set, the set of 
storage registers upon which its instruction is to 
operate. This the module does by, in effect, 
opening a branching path (sequence of gates ) to 
the operands. The path-building action depends 
upon two properties of modules: 

First, by setting bit p in its storage regis- 
ter equal to 1, a module may be given special sta- 
tus which marks it as a point of origination for 
paths; the module is then called a P-module. 

Secondly, each module has a neighbor, distinct 
from its successor, designated as its predecessor 
by bits qq, q 2 in its storage register; the line 
of predecessors of a given module Mq is then defin- 
ed as the sequence of all modules [Mg, Ml, . . . ,Mj5-, . . . ] 
such that, for each k, Mq- is the predecessor of 
M k“l and M k~l ls ’ fche successor of M k . Note that 
the line of predecessors may in extreme cases be 
infinitely long or non-existent. The line of pre- 
decessors of an active module ordinarily serves to 
link it with a P-module (through a series of open 
gates ) . During the initial part of phase two the 
path specification bits y Q , . . . ,y n and dg, . . . ,d^ in 
the storage register of an active module Mq, are 
gated down its line of predecessors to the nearest 
P-module (if any) along that line. The path speci- 
fication bits are then used by the P-module to op- 
en a branching path to the operand set of the act- 
ive module. 


Each path must originate at a P-module. The 
modules belonging to a given path can be separated 
into sub -sequences call segments. Each segment 
consists of y modules extending parallel to one of 
the axes from some position (i,j) through posi- 
tions (i+bq, j+b 2 ), (i+2bq, j+2b2)>. . . , 
(i+(y-l)bq, j+(y-l)b 2 ), where bq = ± 1 or 0 and 
b 2 = ± (l-bq); the module at (i+ybq, j+yb 2 ) will 
be called the termination of the segment. Each 
module possesses four *-registers and if the module 
belongs to a segment in direction (bi,b 2 ) the ap- 
propriate * -register, (bq,b 2 )*, is turned on gat- 
ing lines between (i,j) and (i+bq, j+b 2 ). Since 
each * -register gates a separate set of lines, a 
module may (with certain exceptions ) belong to as 
many as four paths . Once a *-register is turned 
on it stays on until it is turned off; thus a 
path segment, once marked, persists until "erased". 

Each segment of a path results from the com- 
plete phase two action of a single active module; 
however, since a path may branch, more than one 
segment may result in one time -step from the act- 
ion of a given active module . After the digits 
y n , • • • •>yo 5 are S a ^ e< ^ to the nearest 

P-module along the line of predecessors of the 
active module, new segments are constructed at 
the termination of each branch of the path origi- 
nating at the P-module. Note that, because of the 
branching, there will be more than one path termi- 
nation . 

Branching is controlled by the digits dj, . . . , 
dg. To each of the four digits dj, ...,dg corres- 
ponds one of the four neighbors at each branch 
termination. If dq=l then, when the path is ex- 
tended, at each existing path termination a new 
branch will be sent through the ith neighbor paral- 
lel to the axis . 

Path extension takes place only when bit y n =0; 
then bits yn-i> • • • ^yo determine the common length 
of the new segments and bits dj,...,dg determine 
their directions. If y n =l then final path seg- 
ments, if any, in the directions specified by 
dj, . . . ,dg are erased (bits y n _q, . . . , 7 q not being 
used in this case). In order to prevent inter- 
ference of one path with another, or with itself, 
a set of priority and interlock rules are requir- 
ed. These rules will not be specified here but 
the interested reader can see a complete set of 
such rules for a similar computer in the 1959 
E.J.C.C. paper cited previously. 5 

3) During phase three, an active module exe- 
cutes the instruction contained in its storage 
register. This involves the following modules: 
the active module itself holds the order code in 
bits i 2 , iq, ig of its storage register; the stor- 
age registers of the modules terminating the near- 
est path contain the set of words to be operated 
on (the operand set); finally there must be a mod- 
ule which serves as arithmetic unit. In order to 
serve as an arithmetic unit, bits (p,a) in the 
storage register of a module must first be set to 
the value (0,l), giving the module special statps — 
A-module status. (Note that this means a module in 
P-module status, p=l, cannot be an A-module). If 
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M(i,j) is an active module then the first A-module 
along its line of predecessors serves as the 
arithmetic unit. 

A short , though representative , set of orders 
follows: 

(i) Execution of OR/ADD causes the following 
sequence of actions: first the numbers stored at 
the modules in the operand set are transferred down 
the branches of the path toward the P -module ; as 
these numbers meet at branch-points a resultant is 
formed equal to the bit-by-bit disjunction of the 
incoming numbers (i.e. bit j in the resultant is 1 
only if at least one of the incoming numbers has 1 
at position j); when the final resultant is formed 
at the P-module it is transferred along the line 

of predecessors to the nearest A-module; there the 
number is added to whatever number is in the stor- 
age register of the A -module . Note that this se- 
quence of actions takes place wholly within phase 
three of the time -step in which the instruction is 
executed. 

(ii) Execution of AND/ADD proceeds just as 
OR/AND except that a bit-by-bit conjunction (out- 
put bit is 1 only if all corresponding input bits 
are l) takes the place of bit-by-bit disjunction. 

(iii) Execution of STORE causes the number in 
the storage register of the nearest A-module to be 
transferred to the storage registers of all modules 
in the operand set. 

(iv) Execution of TRANSFER ON MINUS depends 
upon the number in the storage register of the 
nearest A-module. If, in this number, y n =0 then 
at the end of phase three the active module be- 
comes inactive and its successor becomes active. 

If y n =l then each of the modules in the operand 
set, rather than the successor, become active. 

(v) NO ORDER causes the execution phase to 
pass without the execution of an order. 

(vi) STOP causes the active module to become 
inactive without passing activity on to its suc- 
cessor at the next time -step. 

With the exception of the TRANSFER and STOP 
orders, the active module becomes inactive and its 
successor becomes active at the conclusion of 
phase three. Just as in the case of phase two 
some rules are required to prevent interference of 
active modules and to provide for cases where there 
is no nearest A- or P-module along the line of pre- 
decessors (the reader is again referred to the 
1959 E.J.C.C. paper) 5 . 


The storage register of each module in the pre- 
sent formulation consists of n+l 4 bits labelled in 
the following order: 


bit number: 

n+lA n+13 . . .14 13 12 11 10 9 8 7 6 5 4 3 21 

label : 

y n y n-l'" y 0 d 3 d 2 d l d 0 X 2 X 1 *0 S 1 S 2 \ ^2 P a 

The function, in the active module, of each bit 
group has already been discussed. 


Ill 

MATHEMATICAL CHARACTERIZATION OF 
ITERATIVE CIRCUIT COMPUTERS 

One purpose of the mathematical characteriza- 
tion summarized here is to define the class of 
iterative circuit computers precisely enough to 
allow mathematical deduction to be used in their 
study. This property of the characterization will 
be used in later work in an attempt at establish- 
ing a theory of adaptive systems (see the next 
section). At the same time the characterization 
can be used to generate a wide range of computer 
prototypes, each with different structural and 
operational characteristics. Thus, the character- 
ization can also be of help in the design of 
solid-state computers. 

The characterization is made up of the 
following parts: 

1) The positions of the modules are indexed 
by the elements of a finitely -generated abelian 
group, A. The particular group chosen determines 
the "geometry" of the network; for instance, by 
choosing the appropriate group, the modules can be 
arranged in a plane, or a torus, or an n-dimension- 
al cube, etc. Thus, for a computer with the mod- 
ules arranged in a 2 -dimensional rectangular grid 
1000 modules on a side, A would be the abelian 
group with two generators a 1 ,a 2 satisfying the 
relations 

1000 a-]_=e 
1000 a 2 =e 

where e is the identity element of the group. 

The group. A, is restricted to being a finitely- 
generated abelian group for several technical rea- 
sons . One reason is that the elementary theory of 
such groups is decidable. When taken with the 
rest of the definition of iterative circuit com- 
puters, this implies that the operation of the 
computer is effectively defined. Also any such 
group can be decomposed into a direct product of 
cyclic subgroups. Thus the elements of the group 
can be represented uniquely as n-tuples on the 
basis of certain sets of generators of the group 
(in other words, the modules are arranged in a 
"regular" fashion). 

2 ) In order to determine the immediate neigh- 
bors of a module we must specify a finite set, 

A° = (a-^, ...,a k ), of elements selected from the 
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group A. Then the set of immediate neighbors of 
the module at cte{A} are the modules at a^(o;) = 

Q+a^ for all a.e A°, where + is the group operation. 
For example, if we have a module at coordinates 
(i,j) relative to generators a-j_,a 2 and we wish its 
immediate neighbors to be at coordinates (i+1, j), 
(i, j+l), (i-1, j), and (i, j-l) then we could 

choose A° = {a-, ,a„,a“ P ,a“ Y } , where a7 P is the group 
inverse of a^ . 

3) The state of each module in the computer 
at each time t must be drawn from a finite set, S, 
of allowable states. S=XeY, the cartesian product 
of the sets X and Y. X can be any finite set of 
elements — the elements will be called "storage 
k 

states"; Y=II Yj_, a cartesian product of the sets 

Y i =R={a 1 ,^}a h( a k ,$z(}. 

In what follows the notation Sq will be used 
to denote the state of the module at position Ol at 
time t. A similar convention will be used for the 
components of S . Note that the elements of Y can 
be though of as k by k matrices. The matrix Yq 
which holds for the module a at time t is called 
the connection matrix of a at time t. The i^* 1 row 

of Y ai = (Y Sn^ Y ai2^'- jY aik^ speclfies whictl 
of the k immediate neighbors of a are connected 

through a to the module at ar Y (a); if Y~. .=a . then 

1 QhJ J 

the module at a. (a) is connected through Qt to the 
J 

module at Y (a) otherwise not. 

4-) Changes in the k rows Y^, . . . ,Y^ of the con- 
nection matrix Y from one time step to the next, 

% to Y Sf ', are determined by a set of k pro- 
jections: 


P i : S -> R = {a 1 ,^}a ... H{a k ,^}. 

The way in which these changes are effected will be 
described in terms of the transition equations to 
be giveh shortly. 


5) Change in the storage state from one time 
step to the next, Xq to X^*"'", is determined by a 


function 

k 

f : n S -» X 
i=l 


which will be called the "sub -transition function". 
Again f can be best explained in terms of the 
transition equations. 


A particular selection for each of the five 
parts described in l) through 5). (A,A° ,X, {P ± },f ) , 

determines a particular iterative circuit computer. 


In addition a function B : {A )sa£ t ) -•> S may be ef- 
fectively defined for some pairs (a,t); Bg, gives 
the input to a at time t, when B is defined for 
(Qjt ) . Once a particular iterative circuit com- 
puter is specified (and its initial state is given), 
the transition equations together with the function 
B determine the operation of the computer. 


The transition equation for the connection 
matrix y£, in terms of its elements, Y^. ., is: 

& oiy 

, if gj\ = ^ ("erasure") 


Y^t Y = <{ 

cdj 


- “ d = 0 


("no change") 


= p Sij > if p Sij = a j 


'Qij 

("construction" ) 

nth _.f> TJ /C3"t' 


where Pr. . is the j xn component of P. (S x ) 
CP-o ° i v cr 


q 3ij = 0 ’ if Vd * ^ 804 p ~ . * a . 


Pij 


= 1 , if P t = f 

Pij 

= &( \ J (P)jl,..., q a J 0)jk ) ^ 


otherwise 


defining &(c , . . . ,c ) = 1 , if all c .=1 
lk j 

= 0 , otherwise . 


As mentioned in 3) above, row i of the con- 
nection matrix Y& can be interpreted as specify- 
ing a set of modules a . (a) connected through a to 
J 

a” P (o:); for each such j, row j of the connection 
matrix Y a j( a ) may specify other modules a^a^ (a) 
connected, via a^(a) and then a, to a7 P (o:); 


appropriate rows of the matrices Y a^a-(a) 


may 


specify still others; etc. In other words the 
matrix Y^ tells how information is to be channeled 
through p to its immediate neighbors, the matrices 
for these neighbors tell how the information is to 
be sent on from there, etc. In this way each 
module serves as the base of what may be a complex 
branching tree channeling information to it. It 
will be seen (in the transition equations for Xq) 
that modules belonging to the tree for oc pass in- 
formation to a without a time -step delay. 


The transition equation for Yq, can now be given 
the following interpretation: Broadly, P^ (ie 
p i( S t)) specifies changes in row i of the connect- 
ion matrix Y^ while = (0^ • • • ,Q^ k ) prohibits 



certain of these changes. More specifically 
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= 0 prohibits construction of a new connection 

from aJa) through a to a7"*"(o:) at time t. &L . = 0 
J 1 cju. j 

just in case construction of a new connection is 
indicated somewhere in the tree for a. Some 
thought will show that this rule (others, at least 
superficially more general , could have been chosen) 
implies the following desirable conditions: 

(i) a cycle of connectionswithout delay cannot 
be formed (operation of modules belonging to 
such a cycle would in general be indeterminant — 
consider the analagous case of a set of one-way, 
non-delay switches arranged in a cycle) 

(ii) the tree for any given module a never in- 
cludes more than a finite number of modules 
(even if, for theoretical purposes, the group 
A is infinite). 

The transition equation for Xq; is: 

4 +1 ■ f S a k (0^) 

B U t) = V lf 

and Bq + 1 not defined 
P 


= B^ +1 , if Y^= (?(,..., 4) 


t+1 


and B t+1 e S 


•'‘ Vip )!! 1 

if Y^t 1 / 4) 


If Y^ + ^ = d then define S' . 


= S„ if B ( Q \ not defined 


= B^ + ^ . , otherwise . 

a i(j3)" 


Under interpretation the transition equation 
for Xq, specifies the storage state, ^t+1, i- n ‘ terms 

Q 

of the states at time t, S^, of the modules |3 be- 


longing to the connection tree for a. Note that, 
because of the recursive definition of S'(t), f 
may be iterated several times in the determination 


Xq — compare this to the determination of the out- 
put of a tree of switches without delays. 


TOWARD A THEORY OF ADAPTIVE SYSTEMS 


As already mentioned, the work reported here 
is part of a larger effort which has as its goal a 
theory of adaptive systems. The effort is an indi- 
vidual one and, of course, reflects particular 
biases of the author. This section will discuss 
the relation of the present paper to the broader 
program . 

The first step of this program was the descri- 
ption of a computer which could simulate the oper- 
ation and, in certain respects, the structure of 
any automaton (growing or fixed). The second step, 
summarized here in part III, consisted in giving a 
general and formal description of computers like 
the one first obtained — the iterative circuit com- 
puters . The resulting mathematical characteriza- 
tion represents a broad class of machines, of 
arbitrary geometries, etc.; by an appropriate 
choice of (A,A°,X,{P^},f ) it is possible to repre- 
sent directly not only the behavior but also the 
changing structure and local operation of any 
given potentially-inf inite automaton, tessellation 
automaton, n-tape Turing machine, or growing logi- 
cal net. (in this respect note, for instance, 
that the class of iterative circuit computers pro- 
perly contains A. Church's class of potentially - 
infinite automata — any two modules in an iterative 
circuit computer may eventually become connected 
so that either affects the other in a single time- 
step, whereas in a potentially -inf inite automaton 
the corresponding delay, in time -steps, increases 
with increasing separation and is a constant for 
any given separation).? 

For an iterative circuit computer the ideas 
of sub -program and automaton are, in an important 
sense, interchangeable. For any automaton, a 
sub -program can be written which not only has the 
same behavior but also the same changing structure 
and local operation. On the other hand, a given 
sub -program will in general occupy a finite number 
of modules in the computer and will have its action 
(or state) determined by bordering modules and the 
input function B. Thus, as a survey of the char- 
acterizing equations will show, an automaton or 
growing logical net can be constructed which 
mimics the sub -program. 

It is important to note that sub -programs can 
be set up which, for instance, can shift themselves 
from one set of modules to another set, i.e. from 
one position to another. Thus the underlying geo- 
metry of the iterative circuit computer (given by 
A and A° ) in effect determines the geometry of a 
space in which the sub -program is embedded; the 
transition equations then serve, in a sense, as 
the laws of this universe. For this reason the 
sub-programs of a given iterative circuit computer 
will often be spoken of as "embedded automata" . 

This view of the results of the second step leads 
directly to the third step. 
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The central object of the third step is to pro- 
vide formal apparatus for the implicit definition 
of automata — definition by means of generators and 
relations on these generators. Implicit definition 
of an automaton is analogous to the implicit defi- 
nition of an algebraic group. In the case of the 
group, instead of giving an explicit listing of 
the elements of the group and their interrelations, 
the group is defined (often quite compactly) in 
terms of a set of generating elements and relations 
on products of the generators. In a similar sense 
an automaton can be implicitly specified by an ini- 
tial set of elements and a set of growth rules. 

The mathematical characterization of iterative 
circuit computers provides the apparatus needed 
for a precise formulation of automaton generators. 
Specifically, the generators will be sub -programs 
which can be thought of as (relatively) elementary 
embedded automata having the following properties: 

(i) movement--the generators will in general 
be capable of shifting as a unit from one 
position to another (as specified by input 
B(a,t) or the state of adjacent modules --note 
that motion may be random if the input sequen- 
ces B(o;,t) are random), 

(ii) connection- -generators will combine 
under conditions specified internally (within 
the sub -programs ) to form larger sub -programs 
capable of moving and acting as units, 

(iii) production — generators can alter the 
state of adjacent modules (note that a suffici- 
ently complicated generator could directly 
duplicate itself ) . 

The generators will act upon other generators or 
other sub -programs present in the computer ("pre- 
cursors" ) by connecting them or breaking them into 
components . The generators will all be acting 
simultaneously and if a given generator duplicates 
itself the duplicate will also in general be active . 

So that any possible automaton can be defined 
in terms of the generators it is necessary to 
choose the set of generators so that any possible 
program for the computer can be represented by an 
appropriate connected set of generators (cf. the 
process of picking a set of instructions suffici- 
ent for a universal computer). Once this is done, 
the generation of particular automata can be ef- 
fected by controlling rates of production and 
connection, movement and contact, the nature of 
precursors present, etc. That is, the relations 
on the generators will consist of specifying the 
initial states and input sequences which control 
such factors. For any given iterative circuit com- 
puter and set of generators, the relations possible 
can be given an appropriate equational form. Under 
one approach, the way in which the generators are 
initially connected and the nature of the precur- 
sors in the "environment" are sufficient together 
to specify the generated automaton. Things be- 
come particularly simple if the generators cannot 
interpenetrate when moving (a kind of "billiard 
ball physics"). 


A given program, because of the loops or iter- 
ations, is much more compact than the complete 
sequence of steps in the calculations it controls. 
In the same sense the connected system of genera- 
tors which specify a given automaton will be much 
more compact than the automaton generated. Thus 
the automaton can have scattered throughout its 
structure complete implicit descriptions of its 
structure --such considerations play an important 
part in the study of self -repairing automata. 

In the implicit definition of an automaton 
certain feedback phenomena play a crucial role. 

The feedback phenomena can be to some extent iso- 
lated by observing at what level a given generator 
system falls in the following succession of cate- 
gories (each of which properly includes its 
successor): 

(i) productive systems — the generator system 
produces other generators or precursors, 

(ii) autocatalytic systems — the generator 
system produces generators or precursors 
which are used in its construction, i.e. the 
system produces some of its own components, 

(iii) self -duplicating systems — the generator 
system produces duplicates of itself. 

Such considerations lead directly from step three 
to step four and from work in progress to work 
which lies in the future. 

The central object of step four will be to 
define the term "adaptive system" for embedded 
automata. Because of the formal nature of the 
definition, it then becomes possible to investi- 
gate these adaptive systems deductively (and by 
simulation). The beginnings of such a definition 
lie in the following consideration: With the help 
of concepts such as autocatalytic and self -duplic- 
ating generator systems it is possible to define 
such concepts as steady-state equilibria and 
homeostasis for embedded automata. In fact one 
can go quite far in this direction obtaining dis- 
crete state relaxation processes (Southwell), 
morphogen standing -wave phenomena (Turing) and so 
forth . > 13 Automata exhibiting these properties 
will usually have the desirable property that 
small changes in structure result in small changes 
in behavior (at least over a certain range). Thus 
the behavior of a given automaton of this type 
gives some indication of the behavior of all auto- 
mata of similar structure ("hill -climbing" techni- 
ques become applicable). If the generator system 
for such an automaton has a hierarchical structure, 
then a small change in structure produces a small 
effect in proportion to the "position" of the 
change in the hierarchy. That is, a generator 
system may consist of autocatalytic or homeostatic 
systems, systems of these (which may or may not be 
autocatalytic or homeostatic), etc.; changes at 
the upper levels of the hierarchy will generally 
have a greater effect than those at lower levels. 

By making changes first at the highest level and 
then at progressively lower levels of the hier- 
archy, it should be possible to narrow down rather 
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quickly to any automaton in this category having 
some initially prescribed behavior. 

Changes in the generated structure result when 
relations on the generators are altered. The ef- 
fect of such alterations can perhaps be more 
clearly seen under the following interpretation. 

The generation of a particular automaton can be 
looked at as if all possible generation processes 
are going on simultaneously but at different rates. 
Some will be going very slowly (infinitely slowly 
in the limit) while others will be proceeding 
very rapidly. The way in which these rates are 
changed in order to change the generated automat- 
on will have important consequences with respect 
to the adaptiveness of the overall system (cf. 

A. L. Samuel's work on changing the weights of a 
"checker-move tree").-'-® 

As a final point it should be noted that the 
environment of an embedded automaton can be made 
as simple or complex as desired. Since adaptation 
must be defined in terms of the range of environ- 
ments in which the automaton is to be embedded , 
this is an important factor. It has already been 
noted that the environment may contain other 
sub-programs (precursors) or in fact other embed- 
ded automata. The latter case amounts to an im- 
plicit definition of the environment since only 
the initial state and internal rules of each of 
the embedded automata need be given. Contrast 
this with an explicit definition which would re- 
quire a point-by -point, time -step -by -time -step 
description of the state of the environment. If 
the precursors in the environment are relatively 
elaborate and sophisticated then the adaptation 
process will look similar to a heuristic learning 
system (cf. Newell -Shaw -Simon)? If precursors 
are absent or simply generators then the adaptat- 
ion process will look more like the processes 
considered by Friedberg. It seems likely that 
implicit definition of the environment will play 
an important part in the development of step four. 
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ON PROGRAMMING A HIGHLY PARALLEL MACHINE 
TO BE AN INTELLIGENT TECHNICIAN 

* 

Allen Newell 
The RAND Corporation 
Santa Monica, California 


Summary 


This paper speculates on how to program 
a machine that is suitable for microelectronic 
components to be an intelligent technician. 

The point of departure is from a class of ma- 
chines described by J. H. Holland in a con- 
current paper entitled, "On Iterative Circuit 
Computers Constructed of Microelectronic Com- 
ponents and Systems." These machines consist 
of a regular lattice of active modules, each 
possessing both processing and memory functions. 
The goal is a machine with the problem solving 
capabilities of a smart human technical assist- 
ant, and the volume processing capabilities 
normally associated with digital computers. 

This goal is chosen because it coincides with 
many current developments. After discussing 
the eventual capabilities desired and the most 
striking features of Holland’s machines, the 
speculation proceeds by considering the basic 
organization for information processing. This 
is followed by briefer treatments of the organ- 
ization for problem solving, supervision, in- 
terpretation and production. 


Introduction 

This is a speculative paper. I have been • 
invited to provide a link in a chain of reason- 
ing stretching from the impending advances in 
microelectronic components to their social con- 
sequences. I take as given a class of machines 
postulated by J. H. Holland in the preceding 
paper^, which I will henceforth call Holland 
machines. A Holland machine is very different 
from any current computer, reflecting both 
potential virtues and vices of microelectronic 
techniques. On balance, of course, the great 
increase in speed of action and number of com- 
ponents guarantees a great increase in proces- 
sing potential. I am to show how this potential 
can be realized. 

* All my thinking on intelligent processes and 
computer languages has occurred in the context 
of joint research with J. C. Shaw and H. A. 
Simon. For the speculative application of these 
ideas to Holland’s machines, I alone must take 
responsibility, of course. I am indebted to 
J. H. Holland, He not only provided the basic 
stuff out of which to fantasize, but he helped 
me to clarify the underlying concepts and sug- 
gested implications of the modes of organiz- 
ation I was trying to achieve. 


A Holland machine is so strange on first 
contact that the problem seems to be to regain 
the programming facility attained on current 
machines in a decade of effort. But magnitude 
increases in basic capacity should yield equiva- 
lent increases in delivered power. The prospect 
of working hard to reestablish what already 
exists is not inviting. Hence, my task includes 
providing some vision commensurate with the new 
capacities. 

The givens and the desired are yet 
further apart. A Holland machine is an abstract 
automaton, and I am free-that is, forced — to 
specify the exact version that will suit my 
needs. The boundary between the logical designer 
and the programmer has shifted so that by any 
normal job description I play both roles. The 
difference lies in the components available to 
me. I have not AND’s and OR’s, but "semi- 
computers," out of which to construct a machine. 

The Vision 

An Intelligent Technician 

Of the myriad things that will occur 
with the continued development of information 
processing machines, I will select just one 
for consideration — a machine that is an in- 
telligent processor of information. There is 
nothing very dramatic about such a machine, 
except perhaps the results that are achievable 
with it. It is difficult to differentiate this 
machine from an extremely compliant, fairly 
bright, human technical assistant, backed up by 
an impressive computing establishment. The user 
will converse with such a machine about his 
problem with the freedom of ordinary technical 
discourse. The language may not be English, 
but it might as well be for the freedom and 
flexibility it will afford. The machine will 
return answers with a rather breath-taking 
rapidity, and it is difficult to predict how 
rapidly a human-plus-machine can advance into 
a problem. Like all machines (and humans) it 
will have finite capacities. Ordinary intelli- 
gence could pose useful problems that require 
processing that the machine can do in a reason- 
able time. Ordinary muddleheadedness could so 
confuse the machine that it would not know what 
is desired. Ordinary knavery could fool it 
into doing the wrong thing. In short, the total 
product will depend on the joint intelligence 
and processing capacity of the user-plus-machine. 
A smart, articulate, well prepared man will 
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produce more with the machine than a dull, am- 
biguous, unprepared man. 

All our lives are spent learning to live 
with other intelligences, and relating to an 
intelligent machine will seem more familiar 
than strange. Since we want machines to help us 
solve problems, the more intelligent we are able 
to make it, the more unobtrusive it should 
be in providing this help. Contemplating the 
more extreme forms of this vision, there is 
little to describe about the machine except that 
it gives a great deal of help quickly and with 
very little pain. Not even this picture of 
productive harmony is safe, since we will soon 
accept all the benefits that flow from the new 
capability and will become aware of the con- 
straints and annoyances that mark the new 
boundary of limitations. Joining a reasonable 
level of intelligence with the volume processing 
characteristics we already associate with the 
large computers will produce large and dramatic 
consequences. But these require detailed con- 
sideration for each subject matter, a task ex- 
emplified by the last paper in the chain by C. W. 
Churchman. 2 

Why This Vision? 

I select this particular vision because I 
believe it lies along the main path of com- 
puter development quite independent of micro- 
electronic advances. 

The history of conputers is marked by a 
sequence of innovations, each eliminating some 
factor limiting the rate of information pro- 
cessing. Most of these limitations were the 
time it took humans to make decisions or take 
action. The addition of memory and control to 
a fast arithmetic device bypassed the human de- 
cision time in doing simple numerical procedures. 
This innovation produced the computer essentially 
in its present form. The excessive time that it 
takes for humans to code has given rise to the 
assemblers, algebraic compilers, list languages, 
and the like. The wastage from a half-second 
human tending a microsecond machine is forcing 
the development of supervisory routines and 
interrupt systems that will eventually eliminate 
the operator. 

The intelligent technician is simply an- 
other step in this development. The time for 
human invention and accurate specification of 
memory organization and elementary procedures 
in most programs is already too long. It will 
become intolerable with Holland machines made 
from microelectronic components. We need a 
'•problem-oriented machine" instead of a 
"machine-oriented machine" — and such a machine, 
truly conceived, is equivalent to an intelligent 
assistant. 

I do not consider the vision radical. In- 
deed, the programming and computing world is al- 
ready on its way to achieving it. 


The Requirements of Intelligent Action 

When applied to humans, intelligent 
action means action that achieves desired ends* 
There is no reason to modify this usage with 
respect to machines. An intelligent technician 
must be able to achieve desired ends. As nor- 
mally used the term is ambiguous, since the re- 
sources given to the intelligence are not stated. 
This ambiguity is essential, since high intel- 
ligence in the real world means the ability to 
surmount the "givens" if they get in the way of 
problem solution. It means the ability to 
achieve desired ends even when defined with 
surprisingly little information. Intelligence 
is our word to indicate the fact of successful 
performance against theoretically inadequate 
conditions. Intelligence is the resolution of 
ambiguity by means of an adequate theory. 

The notions expressed above trace capri- 
cious paths through the concept of intelligence. 
They can be summarized in four requirements for 
the behavior of the intelligent technician. 

Indefinite Resources . The intelligent 
technician must be able to solve problems. 

Jfare importantly, he must be able to work on 
any problem that can be stated to him. No 
limit can exist to the resources available for 
working on a given problem. These need not be 
all the methods available nor very good ones, 
but the machine should not quit because it has no 
more things to try. An intelligent human working 
on some hard differential equations will try to 
solve them analytically; then will turn to power 
series; then to a book on trigonometric series 
so he can apply these; then to a treatise on dif- 
ferential equations for some new clue (finding 
this via a reference in his standard text on the 
subject); then to numerical procedures. On and 
on it goes. If there is any fixed limit — any 
sharp restriction to a special class of methods — 
we will recognize soon enough that the intel- 
ligent technician cannot be left on his own, 
that it is really not intelligent enough. 

Indefinite Awareness . A distressing 
feature of current computers is their instability 
in the face of trivial errors, a feature often 
cited in making comparisons between brains and 
computers. To be intelligent is not to be 
trapped easily — not to require external help for 
little things. The machine itself must be aware 
of its own behavior and of the context in which 
its work is done. No sharp boundary can exist 
for which particular features the machine is 
aware , even though it cannot be aware of 
everything. This would soon reveal itself as a 
"quirk" or "blind spot" that limits the intel- 
ligence. 

Indefinite Language. The machine * s 
capabilities are mirrored accurately in the 
external language used between it and the human 



user. For example, if the language is limited 
to procedures, then it is not possible to ask 
the machine to solve problems, such as "Find x 
such that sin x - x « .5." The model of lan- 
guage required might be labeled the "Hide and 
Seek" model. Any information the transmitter 
can hide in the expressions being communicated, 
the receiver can find, providing the rules of 
hiding are given in the language. Any restric- 
tion to a fixed vocabulary, a fixed grammar, or 
a fixed technical area will be viewed immediately 
as a limitation in intelligence. 

Indefinite Accumulation . Anything that is 
reasonably intelligent learns from what it does. 
Indeed, problem solving involves learning in 
many ways — e.g. , learning about trigonometric 
series in the example above. Part of the stream 
of facts, procedures, clues, theories, and so on 
that pass through an intelligent machine will be 
accumulated for later use. Again, no definite 
boundaries can exist on what is selected or how 
it is filed and indexed. Limitations of this 
sort will reveal themselves as inappropriate 
repetitions, and will immediately be seen as a 
deficiency in intelligence. 

Perhaps I am grooming the machine to be a 
genius, rather than a mere technician. I do not 
think so. Educated men possess these character- 
istics to a remarkable degree. Limits will 
exist, or the machine would be smarter than 
most men. But these limits will be indefinite 
and shifting, and will express themselves in a 
global measure of the machine’s intelligence. 

It may seem that all the emphasis is on 
intelligence and none on the volume processing 
capacities that are also needed. On the con- 
trary, the machine is for high volume work. 

The intelligence forms the connective tissue 
that allows the machine to rapidly organize it- 
self to be highly repetitive and efficient. 

General Layout of the Machine 

Although I cannot provide a complete pic- 
ture, I will give some considerations about an 
organization for such a machine. These fall 
under the headings of information processing, 
problem solving, supervision, and interpretation 
and production. These headings are machine 
oriented and are not coordinate with the require- 
ments of intelligent action. The general layout 
is shown in Figure 1. 

An External Language is used between the 
machine and those things with which it communi- 
cates. This language is completely independent 
of the internal structure of the machine. Only 
incidentally will it refer to things arid pro- 
perties inside the machine. The expressions 
in the external language are taken in bodily 
and made available in the Input-output area . 

There is a single Processing area , struc- 
turally homogeneous, but divided into two 
functional areas: the Interpreter and the 


Factory . The Interpreter produces the processes 
indicated by the External Language expressions 
in the Input-output area. These action pro- 
desses are constructed in the Factory, and 
constitute the activities that the machine does 
in response to the external world. 

The processes in the Factory are in the 
Internal Language, as are the interpretation pro 
cesses in the Interpreter. In Holland machines 
it is difficult to distinguish programs from 
processes — expressions that say what to do from 
structures that do it. A language expression 
written in a set of modules may be sufficient 
to convert these modules into a process that 
behaves according to the expression. Hence, 
"internal language," "program" and "process" 
are used interchangeably. 

Besides the Input-output area, there 
are three stores for information. Each is 
structurally distinct because of different 
reading and writing requirements. The Program 
Store contains the large number of program 
forms required by the Interpreter and the 
Factory. The Associative Store contains 
entities with the properties normally associated 
with symbols. The Warehouse is a tertiary 
store that backs up all the other areas. It 
is a reminder that, even in a machine with 10 11 
component’s, access time must be traded for space 
in order to remember enough information. 

Holland Machines 

A Holland machine consists of a regular 
lattice of identical modules, as shown in 
Figure 2. Each module is directly connected 
with its immediate neighbors in the lattice. 
These direct physical connections can form 
paths between distant modules. A module’s 
capacity for paths is limited. Once formed, the 
paths give immediate access independent of 
length. Each module has both memory and pro- 
cessing functions, and may be active simultane- 
ously with and independently of other modules. 
The functions of a module are represented by 
settings of bit patterns, so that a module may 
take on any possible character by writing a new 
bit pattern into it. Operations change the 
state of a module as a function of the states of 
other modules connected to it by paths. These 
include operations for reading and writing in- 
formation into modules and for building up 
paths. The machine is basically synchronous. 

Within the bounds indicated almost any 
kind of a system is possible. There is freedom 
to specify the number of immediate neighbors, 
the kinds of information held by each module, 
the operations performed by each module, and the 
sizes of the paths. For a given amount of basic 
componentry the more complex the module the more 
components it will take per module and the fewer 
modules that will be available. The original 
papers should be consulted for more detail. 
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The following features summarize most of 
the advantages and problems of Holland machines: 

Indefinite Parallelism, The most striking 
feature of Holland machines is their active 
nature. Many processes can run in parallel, 
each of arbitrary composition. Our experience 
is almost entirely with couplet ely serial ma- 
chines. Machines with a small number of in- 
dependent processors are just coming into being. 
Although a telephone system is like a big 
parallel coirputer in many ways, it acconplishes 
unsophisticated functions compared to those 
needed for general computation. There is little 
work in the literature on programming systems 
of this sort. Besides Holland’s papers, work 
by linger^ and Selfridge^ is; relevant, although 
they both focus on pattern recognition. 

Industrial organizations are examples of 
highly parallel systems, and face many problems 
similar to this one. We can expect to be in- 
volved in the same crucial issues of central- 
ization-decentralization, coordination, and 
division of labor. We can also expect the 
growing rationalization of management, typified 
by operations research, to be a prime source of 
clues for programming these systems. 

Local Action . The modules of Holland 
machines work by local contact along paths. In- 
formation is designated by pointing to it, 
rather than by symbols that refer to it. A major 
gain recently made in programming was finally to 
create an entity that behaves in many ways like 
a linguistic symbol (the address with a list of 
associated information). No simple correspondent 
to this exists in Holland machines. 

Iterative Structure . The modules in a 
Holland machine are identical, although different 
Holland machines can be used for the gross areas 
of Figure 1. This iterative character, besides 
being suitable microelectronic production tech- 
nique, provides a "space” already rich in pos- 
sibilities in which structures can be created 
at will by information transfer operations rather 
than actual construction operations (in the car- 
penter’s sense of the word). However, this 
implies that most modules will have only a small 
fraction of their componentry utilized. 

Fixed Network of Connections . The multi- 
dimensional connections between modules seem at 
first glance to be a blessing. The virtue fades 
as soon as arbitrary structures must be processed. 
The information is invariably incommensurate with 
the fixed network. "Dead ends," in which all the 
paths to a module are occupied and no way exists 
to gain access to the module, are continually a 
problem. Sinple data organizations must be used, 
even though they prohibit clever ways of encoding 
particular information. 


Information Pro cessing 

The first level of organization of the 
machine is the information processing level. 

It contains means for building up structures of 
information and for forming processes to operate 
on these structures. It involves specifying the 
Holland machine for the Processing area (the 
Interpreter and the Factory) and indicating how 
the problems of memory and program organization 
will be solved. In the following I draw heavily 
on the programming experience with the problem 
solving programs described in the next section 
and on the list language s^ constructed to aid 
in programming them. Properties of the Proces- 
sing area will be accumulated; then a Holland 
machine for them will be sketched. 

Units and a Principle of Homogeneity 

In current machines and coding a large 
discontinuity exists between the machine level 
and all structures built up from this level. A 
subroutine is not like an instruction; a double 
precision number is not like a number contained 
in one word. The temptation to create Holland 
machines similarly is strong — to create neat, 
powerful, elementary operations for modules and 
then to build up everything from these in 
structures that look very different from modules. 
There would be a preferred size of channel (the 
one between modules), a preferred size of in- 
formation (the word in the module), and a pre- 
ferred set of operations (the operation code of 
the module). 

Contrariwise, the Processing area satis- 
fies a Principle of Homogeneity, according to 
which a module and a higher unit cannot be dis- 
tinguished by any of the conventions for dealing 
with them. The basic structure of the Processing 
area is the following: 

1. The Processing area consists of a 
stack of planar fields, each field containing 
spatial units. These are rectangular in shape 
and vary in size. Their boundaries do not cross 
each other, but units may exist inside other 
units. 

2. The units are connected by paths. 

These run in horizontal and vertical segments 
and may be of any width. Two paths may cross 
each other at right angles, but otherwise two 
paths may not occupy the same space. 

. 3. Bach unit has registers that hold 
information in bit patterns. There can be units 
with any number of registers, and registers with 
any number of bits. 

4. Each unit can accomplish an inform- 
ation process involving reception from some 
paths, transmission along others, and changes 
in internal registers. 



These properties allow units to be built up 
from networks of other units in order to accom- 
plish more complex processes. Similarly, units 
may be analysable into subunits connected by 
paths. If analysis proceeds far enough, units 
are reached that are not further analysable. 

These may be either modules on aggregates of 
modules that, for reasons of speed and space, 
have been arranged internally in ways that do not 
correspond to all the conventions of units. 

It is now unimportant where the modular 
level is, and what operations and registers 
modules contain. I am now free to specify flow 
diagrams of units with arbitrary functions con- 
nected by paths of arbitrary information content. 
No assumptions need be made about the module op- 
erations, except completeness. For the machine, 
the problem of organizing itself has been much 
simplified. 

Spatial Operations 

The machine needs ways to assemble and 
manipulate units and paths. These should also 
be homogeneous and should not require extensive 
knowledge about the distribution of units in 
the field. The following additional properties 
seem appropriate: 

5. Units can be moved in any of the six 
directions (vertical movement between layers is 
needed). The entire unit moves at once with a 
velocity of one module per basic cycle (vari- 
able or faster speeds are complicated). 

6. Movement takes place freely into 
space not occupied by paths or other units. A 
moving unit coming into contact with a station- 
ary unit or path sets the new unit in motion in 
the same direction. Two colliding units stop. 

A moving unit stops upon contact with the in- 
side boundary of a unit. 

7. Paths remain connected to moving units, 
growing and contracting as required. 

8. Units can expand and contract. In ex- 
pansion the same conventions hold as for a 
moving unit. It sets other units in motion and 
stops when it contacts the inside of a boundary. 
In contraction, the inward moving boundary sets 
the inside units in motion, and all motion comes 
to a halt when the subunits are jammed together. 

9. Units can be copied. This always oc- 
curs in the vertical plane, as shown in Figure 
3 (a direct horizontal copy is difficult). The 
figure shows a horizontal copy obtained by 
copying-up, moving-over, and moving-down. 

These capabilities provide convenient ways 
to manipulate structures. Movement is control- 
led by always working inside a higher unit as 
a sort of corral. Rearrangements occur auto- 
matically, since all units tend to move out of 
the way. 


Information Structures 

The more complex the programs, the more 
irregular and dynamic are the structures of 
information built up of units and paths. The 
machine must be prepared for network-like 
affairs, as shown in Figure 4. The left side 
shows a simple tree; the right side indicates 
that matters are not always so simple. To put 
such structures (and their more elaborate 
cousins) into the rectangular grid of the Pro- 
cessing area, a standard outline form is used 
(Figure 5). Each unit is allocated a horizon- 
tal band. Subunits are indented to the right 
and put immediately below their superunits. 

The connecting paths use the space made avail- 
able to the left. As the tree grows all trie 
units are put in downward motion. Each unit 
can also grow independently to the right in 
its band. The right hand limit is given by 
the boundary of the superunit that contains 
the entire structure. 

Splitting Operations . The simple outline 
is insufficient for general networks, as an 
attempt to add the paths for the right side 
of Figure 4 shows. Already paths can cross 
paths, but they must also be able to cross 
units. Splitting the unit is one solution: 

10. A unit can be split into two parts, 
either vertically or horizontally, and these 
two parts may be moved indefinitely far apart. 
The space between the parts contains only 
paths. Elongations of the internal paths of 
the unit run across the split, so that the 
unit remains unchanged as far as information 
flows are concerned. Other paths can run 
along the split, and so traverse a unit. 

The outline conventions and the split- 
ting operation allow the machine to put any 
kind of information structure into the Pro- 
cessing area. This stylized form is often 
less contact than other forms, but it avoids 
solving many problems in memory organization. 

Definitional Control 

No sharp distinction exists between in- 
formational units and processing units. Mast 
data is contained in units that actively trans- 
mit the data over paths, rather than waiting 
passively to be read. Both the Interpreter 
and the Factory are systems of many inter- 
connected, simultaneously active units, and 
techniques must be provided for control and co- 
ordination. 

Experience at the programming level is 
exclusively with sequential control. Se- 
quencing information is given independent, of 
the content of the processes, usually by the 
order of instructions. A form of sequential 
control could be adapted to the present ma- 
chine. Instead, a more radical, fully asyn- 
chronous procedure is used. This is called 
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definitional control , since it rests on the fact 
that procedures work from the defined towards 
the undefined. Although similar to existing 
asynchronous techniques, I am not familiar with 
any discussion of it for general procedures. 
However, some work by Steward” is relevant. 

Any procedure can be put into the form of 
a hierarchy of subprocedures: The lowest pro- 
cedures work on the input data; the next pro- 
cedures work on the outputs from these; the next 
higher ones work on the outputs of these; and 
so on. Figure 7 shows the computation of Ke“^ K 
in this form, as it would occur in the Factory. 
Iterative, recursive, and conditional processes 
can be analysed analogously. 

Assume that each process is sensitive to 
whether its inputs are available and to whether 
its output can be accepted. As long as K re- 
mains undefined no computation takes place. As 
soon as K becomes defined, say by some other 
process feeding a number into it, then the first 
multiplier can operate, since both of its inputs 
are defined. Once its computation is finished 
and it transmits the product, the exponential 
process can operate; once this is finished, the 
final multiplication operates. Finally the re- 
sult is printed. Each process operates as soon 
as it can — that is, when the situation is de- 
fined for it. Until that time, it simply waits. 

Suppose a process at the bottom was feeding 
numbers into K at a faster rate than the printer 
could operate. Until the printer accepts a 
result, the top multiplier does not accept the 
inputs from K and the exponential; the exponen- 
tial does not accept the product' from the lower 
multiplier; and it, in turn, does not accept 
the next value of K. The computation auto- 
matically becomes paced by the printer. 

This form of control partially rational- 
izes the construction of procedures. No in- 
dependent sequencing problem must be solved. 

A process that is put together correctly accord- 
ing to the flow of information is ready to go, 
and will sequence itself automatically. Since 
the machine constructs its own programs, this 
seems a desirable property. 

To obtain definitional control, the follow- 
ing are needed: 

11. Every register is either defined or 
undefined; this can be detected on any path 
leading into or out of the register. (This is 
simple enough unless there are multiple trans- 
mitters and receivers, in which case things 
get complicated. ) No transmit operation will 
be executed into a defined register (it's al- 
ready occupied), and no read operation will be 
executed from an undefined register. 

12. Each unit is either active or in- 
active. If a unit becomes active all its sub- 
units become simultaneously active also. If it 
is inactive it will not perform any processing 


(spatial operations excepted), and all of its 
registers will be undefined. If it is active, 
it will perform its processes as soon as all 
paths involved are admissible (inputs defined 
and output receivers undefined). 

Modules and Mass Operations 

I must at least indicate how the numerous 
asserted properties might be achieved in a 
Holland machine. The ideas seem independently 
interesting because they involve mass operations, 
which affect all modules in a region simultane- 
ously and identically. Little experience exists 
with such operations^; but they seem worthwhile 
exploring. 

Each module consists of two parts: a 
regular part and a substrate. The regular part 
involves operation codes and storage registers 
from which processing and memory functions will 
be synthesized. The substrate takes care of 
defining units with their spatial and activity 
properties. The regular part satisfies the 
homogeneity principle. The substrate achieves 
the homogeneity principle for aggregates of 
modules and has no counterpart at the unit level. 
The regular part need not he defined because of 
the homogeneity principle, but the substrate 
needs discussion. 

The substrate consists of a state for 
each module, some operations for changing state, 
and some paths interconnecting modules. Every 
module is either: 

1. occupied or unoccupied; 

2. active or inactive; 

3. repetitive or nonrepetitive ; 

4. stationary or in motion in one of 
six directions (four horizontal and 
two vertical); 

5. Open or closed, independently, in 
each of the four horizontal direc- 
tions to the transmission of influence. 

All the modules in a plane are inter- 
connected so that they must all change state 
simultaneously. If one module goes from in- 
active to active, all modules go to the active 
state. This network of influence, which ex- 
tends from any module to all other modules, can 
be broken by modules that are closed. If some 
modules are surrounded by a boundary of modules, 
all closed with respect to transmission outward, 
then changes inside affect everything inside 
but affect nothing outside the module. A unit 
is : a rectangular set of modules with a boundary 
that blocks influence from the inside out but 
transmits influence from the outside in. This 
last condition makes possible hierarchies of 
units. Anything that occurs in the larger unit 
affects the subunits (like moves, executes, 
stops, etc.), but not vice versa (a subunit can 
move about in the unit without making the unit 
move). 


A module in motion copies its content into 
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the next module in the given direction. This 
leaves the motion state unchanged, and motion 
continues until something turns one of the 
modules to stationary. As soon as this happens, 
all modules within the unit instantly become 
stationary and the unit stops moving. 

A unit is made active by changing any of 
its modules from inactive to active. All the 
modules are simultaneously made active, and the 
unit immediately begins to "push" to execute its 
process, since those parts which can operate go 
into immediate operation. Two states of oper- 
ation are needed, repetitive and non-repet itive. 

A repetitive module remains active even though 
it has just executed its operation; it imme- 
diately tries again. A non-repetitive module 
sets itself inactive as soon as the operation is 
accomplished. This change immediately affects 
the entire smallest unit that contains the 
nodule, but it does not affect the modules out- 
side the unit. Incidentally, this shows that 
influence must depend on state changes and not 
states, since otherwise the existence of active 
modules outside would immediately reactivate a 
module that had just turned itself off. 

Although this development is incomplete, 
the outlines are clear, including the power of 
the mass operations. 

Symbols and the Associative Store 

Both human language behavior and the pro- 
gramming of complex systems point to the need 
for a stock of symbols for general information 
processing. Symbols are entities with the follow- 
ing properties: 

1. It is always possible to obtain easily 
a new symbol, not otherwise being used (within 
the limits of the total stock). 

2. It is possible to produce indefinitely 
many occurrences of a symbol (symbol tokens) and 
these can be moved around and placed in struc- 
tures. 

3. Given the occurrence of two symbols, 
it is possible to determine whether they are 

the occurrences of the same or different symbols. 

4. A symbol is a locus of associations. 

This means the following operations can always 
be performed: 

a. An associative link can be formed 
from a symbol S to a symbol S', and the link 
labeled with the symbol S". Only one link with 
a given name, S", exists for a given symbol, S. 


* I reiterate that this formulation, and several 
others throughout the paper, come from joint 
efforts with J. C. Shaw and H. A. Simon. 


b. The symbol associated to the symbol 
S by the link labeled S» can be found. This i3 
the inverse operation to the one above, and forms 
the sole significance of the "associative link." 
No association with label S" needs to exist, but 
if it does it is unique. 

c. The link labeled S" for the symbol S 
can be destroyed. 

This set of properties ties the concept 
of a symbol to the concept of an associative 
memory. Likewise, these properties give one 
definition of an associative memory, which is 
equivalent to being able to form and distin- 
guish arbitrary single-valued functions of one 
discrete variable. It implies that an ade- 
quate concept of symbol and reference can be 
formed |rom the concept of a locus of associ- 
ations. Without arguing any of these points, 
this formulation is taken as the requirement 
for giving a machine symbols. 

Although the Processing area contains an 
elaborate system for processing information, 
it provides nothing that resembles a symbol. 

One unit designates another, not by having a 
symbol that names it, but by directly pointing 
to it. Although the bit patterns can be copied, 
transferred and compared, they do not allow 
associations. They are used entirely as objects 
and never refer to anything. 

A separate Holland machine, the Associ- 
ative Store, provides symbols. The same dif- 
ficulties in trying to map a free structure 
into a rectangular grid arise here. To each 
symbol there corresponds a structure that 
encodes an indefinite number of associative 
links, so that the total store is a completely 
arbitrary, dynamically changing network. 

Figure 8 shows a scheme for the Associ- 
ative Store. The Holland machine for the Store 
consists of planes of modules. Each symbol has 
two adjacent lines of modules in the Store for 
its associations and an unique bit pattern for 
its token. Each association takes two modules, 
one in the lower line and one right above it in 
the upper line. The lower module is concerned 
with the symbol that labels the link, and the 
upper module is concerned with the symbol to 
which the link is made. 

Each module has a single register that 
holds a token. The horizontal data paths, 
which transmit tokens, are open, so that all 
the modules in a line see the same token simul- 
taneously. The horizontal command paths are 


# This is an old notion with psychologists. 
However, this associative system is considerably 
more powerful than any proposed for organisms 
by psychologists. The essential difference is 
that here the links themselves can be labeled with 
symbols and hence can take associations. 
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directional to the right, so that a command 
issued by a module is seen simultaneously by all 
modules to the right but by none to the left. 

The vertical command paths connect each module 
to its immediate neighbor above. 

Each module is either in the "defined” or 
"undefined" state. Each can record the token 
on the data path in the register; transmit the 
token in the register onto the data path; compare 
the token in the register with the token on the 
data path; and receive and transmit commands. 
Three specific patterns of these actions are re- 
quired for the associative operations. (The 
•link token' means the token of the symbol that 
labels the link; the 'symbol token' means the 
token of the symbol to which linkage is made.) 

1. Form an associative link ; The symbol 
token is put on the upper data path and the link 
token is put on the lower data path. Each link 
module in the defined state compares the link 
token with the one in its register, and if they 
are equal commands the symbol module above it 

to record the symbol token in its register. Each 
link module in the undefined state records the 
link token in its register, commands the symbol 
module above it to record the symbol token in 
its register, and sets itself to be defined. 
Further, all modules that took action command 
all modules to the right that have this link 
token registered to set themselves undefined. 

This order overrides any action to set a module 
defined. The net result is that one and only 
one module-pair records the association. Many 
modules may have the information in their regis- 
ters, but only the leftmost one will be in the 
defined state. This assures that there cannot 
be two links with the same label. 

2. Find an associated symbol . The link 
token is put on the lower data path. Each link 
module in the defined state compares this with 
the token in its register. If they are the same, 
it commands the symbol module above it to trans- 
mit the token in its register onto its data path 
as the symbol token. By the nature of the system 
either no module transmits or exactly one module 
transmits. 

3. Destroy a link . The link token is put 
on the lower data path. Each link module in the 
defined state compares this with the token in 
its register. If they are the same, it sets 
itself to the undefined state. 

The scheme is still incomplete in essential 
respects. Most important is the connection be- 
tween the token, which is a bit pattern, and the 
two lines of the associative store that corres- 
pond to its symbol. The mechanism described 
simply delivers and remembers bit patterns in 
response to bit patterns transmitted on special 
data paths. Some way must exist for units in 
the processing area to take a token and con- 
struct a path for themselves to the data paths 
in the associative store. Once they connect 


they must be able to command the associative 
operations and receive bit patterns back along 
the path in return. 

This problem is strikingly similar to 
that of a telephone system. Connections must 
be made from a set of subscribers, the units, 
that are widely scattered throughout the pro- 
cessing area. The connections are only tempor- 
ary; once formed they persist for variable 
periods. Finally, only a few subscribers want 
access to the store at any one time. The token 
bit pattern, then, is a "telephone number" and 
the units "dial" the associative store. 


Problem Solving 

The organization of the machine at the 
information processing level is now clear 
enough to allow a brief consideration of higher 
levels of organization. Most important is an 
ability to problem solve. 

Combinatorial Problems 

Recent work on heuristic programs pro- 
vides considerable information on the processes 
involved in problem solving. Programs have been 
written for tasks which are sufficiently complex 
and difficult to require intelligent action by 
humans. These tasks include theorem proving in 
elementary domains, chess and checker playing, 
musical composition, and some management science 
problems'. All these programs formulate the 
problem in a common way and attempt to solve the 
problem by a common approach. 

The problems all involve a set of objects 
and a set of operators for producing new objects 
from old objects. The goal is to find a se- 
quence of operators that produces an object with 
certain desirable properties, given some in- 
itial objects. For example, in chess the ob- 
jects are positions; the operators are legal 
moves; and the goal is to produce a sequence 
of moves that will result in a winning posi- 
tion. In theorem proving, the objects are 
theorems; the operators are rules of inference; 
and the goal is to produce a sequence of in- 
ferences that will result in the theorem to be 
proved. 

These problems are combinatorial in 
nature. They involve the selection of a se- 
quence with certain properties out of the set of 
all possible sequences. The possible sequences 
can be represented as an expanding tree: given 
an initial object, application of the operators 
in all possible ways yields a set of new ob- 
jects. Applying the operators in all possible 
ways to each of these objects yields another 
(much larger) set of objects two steps removed 
from the starting point. This can be repeated 
to generate the tree to any depth by taking all 
of the objects obtained at depth D and applying 
the operators in all ways to them to get the 
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objects at depth D+l. If approximately H‘ ap- 
plications are possible for each object, there 
are ff branches at each node, and of the order 
of Ir objects are generated at depth D. Thus 
the number of paths in the tree goes up exponen- 
tially with depth. 

If the solution lies at depth D, then of 
the order of paths must be* examined to dis- 
cover the solution. It is easy to show that 
immense numbers of paths occur for any real 
problem. For example, chess has a B of about 
30 and a D of about 80, yielding about 10 120 
paths. Since the entire tree cannot be ex- 
plored, it is necessary to be sophisticated in 
searching — to look only where the answer is. In 
hard problems no single item of information tells 
exactly where the desired object is, and many 
different bits of information, each of low cali- 
bre, must be used to restrict the search to a 
reasonable number of branches. These odd bits 
of information and the ways of organizing them 
are called heuristics. An example from geometry 
is "Don't try to prove two angles equal unless 
they appear about equal in a well drawn diagram." 
This heuristic reduces the average number of 
branches at each node by rejecting some that 
would otherwise have to be explored. The effect 
of heuristics is to reduce the parameters, B 
and D, in the search formula, B^, leaving the 
form of the problem unchanged — search in an ex- 
ponentially expanding space. 

Problem solving by heuristic search is not 
the universal solvent that dissolves all parti- 
cular methods. It is the last ditch defense 
when no special methods are known. The machine 
has many special methods, from analytic dif- 
ferentiation to Newton's method, and uses them 
wherever appropriate. It also needs some model 
for all the other problems that arise. Heuris- 
tic search is the best available. 

Parallel Search 

The problem is to organize the machine for 
problem solving. In current serial machines a 
single program, the problem solver, conducts the 
search. It carries with it an accumulation of 
information about the maze and its activities 
in it, and combines this with its more general 
heuristics to decide which branches to explore. 

If on the average it takes C time units per 
node, then the search time is of the order of CB®. 

Being parallel, the machine is not limited 
to a single problem solving process. With P 
problem solvers, the time of solution is of the 
order of (C/P)B^. This speeds up the process 
by a factor of P, but does not affect the ex- 
ponential, which governs the growth of the search 
tree. Coordination problems also exist, since 
no one problem solver accumulates all the inform- 
ation. Each must contain additional processes 
for transmitting its information and analysing 
the information received from the other problem 
solvers. 


Parallelism can be pushed further. Each 
problem solver can have the task of not only 
creating the B branch points, but of making B 
copies of itself and of setting one copy to work 
independently on each branch. Now every branch 
point is active, and even if it takes an ad- 
ditional K time units to produce each new prob- 
lem solver, the total search time to depth D 
is of the order of (C+BK)D. If this equation 
were only true, all the world's problems could 
be solved in a day I It claims that the search 
time, instead of being exponential with depth, 
is linear with depth. Thus, at a microsecond 
per chess position, it would take only a few 
milliseconds to compute the perfect game. The 
absurdity of this is manifest, but it is in- 
structive to examine why it cannot hold. 

Parallel computation trades space for 
time. The space required to hold the data of a 
problem tree also increases exponentially with 
time (although serial strategies exist that 
keep the space proportional to D or BD). In the 
fully parallel case space must be found to put 
the exponentially increasing number of problem 
solvers. Since space becomes available at the 
rate at which units can move apart, the ulti- 
mate determiner of the growth of the tree is 
the velocity of movement. Since this is al- 
ways limited (to one module per cycle for this 
machine), the time to search the tree remains 
exponentially related to depth.* This is simply 
the Malthusian problem for programs, in which a 
population of programs reproduces itself geo- 
metrically, while its sustenance, space, grows 
only linearly. 

Parallel search is still an effective 
strategy. By suitably spacing the initial points, 
extremely rapid exploration can be had of the 
first part of the tree. The devices described 
earlier for information processing make the 
mechanization of these procedures straightfor- 
ward. Independent problem trees can exist simul- 
taneously in the processing area, each encased 
in an expanding boundary. The entire set of 
growing units moves around in the yet larger 
bounded area that contains them all, utilizing the 
total space in a reasonable, if not optimal, 
fashion. 


Supervision 


Earlier a principle of awareness was 
stated by which the machine should be able to 
detect the consequences of its actions and have 
some ability to correct, modify, or prevent them. 
This covers reliable computation as well as more 
subtle things, such as inconsistencies in the 
user's data. 


* Even if space expands in N dimensions, the 
volume cannot grow fast enough to keep up with an 
exponential demand. However, added dimensions 


help. 

sion. 


If N dimensions are available for expan- 
the search time becomes like B*v . 
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An approach to this problem is to provide 
reliable containment of consequences and super- 
vision by independent processes that are suf- 
ficiently intelligent to handle the errors that 
occur. The unit boundaries, impervious to out- 
ward flowing influence but transparent to inward 
flowing influence, are naturally suited to con- 
tain consequences. The supervisory processes 
are units that sit outside the units they super- 
vise, checking and correcting certain classes 
of errors. Not all errors can be checked, since 
only a finite amount of effort can be spent on 
supervision, and there are many errors the ma- 
chine cannot possibly rectify. 

Two aspects of the problem can profitably 
be discussed further at this level. First, who 
shall guard the guardians? The simplest answer 
is to have another supervisor, as shown in 
Figure 9. Each lower system is contained in a 
walled cell, and the supervisor outside has paths 
into it for monitoring it. The problem is to 
avoid the implied infinite regress. One solu- 
tion is to adhere to the rule that any unit 
that acts outside itself must be supervised, but 
any unit that only takes in information need not 
be supervised. In Figure 9, assume the action 
program is for a user and will result in an ex- 
ternal response. It must be supervised, so 
Supervisor #1 must exist. As long as Supervisor 
#1 takes no action on the action program, it 
need not be supervised. The moment Supervisor # 1 
goes into action, Supervisor #2 must come into 
existence, and so on through an upward recur- 
sion of supervisory routines. As units complete 
their tasks the supervisory units disappear again. 
If supervisors are constructed to act rarely, 
say only when errors occur, then the height of 
the recursion is governed by the successive 
powers of the error probability and is kept 
under good control. 

The second aspect is the nature of general 
computation. Current programs normally consist 
of a single procedure, all of which must be 
gone through to produce the final result. Con- 
trariwise, most procedures carried out by people 
are done in the context of a tree of alternatives, 
where at each step, although one action is taken, 
others are possible to achieve the same end. 

Thus, if the computed value of the cosine is 
wrong, it can be recomputed; if still wrong, it 
can be looked up in a table; if still wrong, the 
look-up procedure can be checked; and so on. 

The strategy is not to correct errors directly, 
but to bypass them and produce the result a 
different way. All computations are to be carried 
out as a problem solving search to discover the 

* An alternative approach, which has been much 
pursued, is to achieve the reliability at the 
level of the smallest component by such means as 
error-correcting codes and multiplexing. This 
approach tries to solve the problem without re- 
course to the larger context in which the com- 
putation occurs. 


solution. The nodes in the search tree are stages 
in the computation; the branches are the pos- 
sible computational actions. It makes little 
difference whether a path to the answer is re- 
jected because it is "a wrong computational 
step" or because it is "a right computational 
step with an error." 


Interpretation and Production 

There are two conflicting requirements 
for this machine. For volume processing of 
data, its programs should be as efficient as 
possible. But, since these programs come to it 
from outside, they should be as easily communi- 
cated as possible. The external language and 
the internal language are conpletely divorced 
from ea’ch other in order to deal with the con- 
flict. The external language is communication 
oriented; the internal language is production 
oriented. The intelligence of the machine 
mediates between them. 

The general operation of the machine is 
as follows: The Input-output area contains ex- 
pressions in the external language, generated 
by both outside users and internal processes. 
The expressions are analysed by a process in 
the Interpretive area, which builds up a pro- 
cess in the Factory that will take the action 
appropriate to them. The Program Store pro- 
vides the parts from which these action pro- 
cesses are constructed, ranging from small 
parts, like multiplication, to entire programs, 
like matrix inversion. These parts are forms; 
they are identical to the final process except 
for certain variable units and paths, which 
must be specified to make an actual process. 
They are moved out of the Program Store along 
unoccupied planes and copies of them deposited 
in the Factory. Then the various unspecified 
units and paths are identified and brought 
together. For this purpose, the variable parts 
of the forms carry descriptions of their func- 
tion. For example, the form for the process 
in Figure 7 might be xe 3 * and would look ident- 
ical to Figure 7 except for an x and an a in 
place of the K and -2, and an expression xe 3 *, 
in place of the print. 

Once a complete process is assembled, and 
the interpreter decides the interpretation is 
correct, it is executed. The interpreter re- 
mains intact during execution to handle dif- 
ficulties, to remove the action process at the 
end, and to decide if anything should be sal- 
vaged for later use. 

The behavior of the machine is a collec- 
tion of such efforts, each composed of an 
action process governed by expressions in the 
external language and mediated by an inter- 
pretive process. Some are built up in response 
to simple external requests and are concluded 
almost immediately and some involve long 
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production runs. Others constitute strategies of 
internal operation, such as the allocation of 
effort between production and improvement. The 
rate at which the machine processes information 
is determined by whatever limit is reached first: 
production time. Factory space, input-output 
equipment, interpretive time, and so on. 

Productive Efficiency 

little study has been given to the theory 
of efficient computation. However, certain 
general features of the machine permit efficient 
programming. 

One source of inefficiency is excessive in- 
terpretation. This generally occurs in repeti- 
tive processing, where interpretation of the 
same procedure occurs over and over again. It 
is avoided by using the first interpretation to 
set up an efficient procedure, so that subse- 
quent interpretation isn't needed. The division 
of labor between the Factory and the Interpreter 
permits this efficiency. A more subtle version 
of interpretive inefficiency is the use of sym- 
bols and other indirect forms of designation. 

The use of direct paths in the Factory permits 
the elimination of most of these costs. 

The machine operates through the use of 
standard building blocks, which it puts together 
in various standardized ways to construct yet 
bigger building blocks. Often a process so 
built up can be made more efficient, both space- 
wise and timewise, by reconstructing it out of 
more microscopic processing units. One of the 
continual activities of the machine is to invent 
more efficient units for processes that are im- 
portant to it. Independent of immediate demands, 
it takes program forms from the Program Store, 
attempts to improve them, and returns them to 
the Store. The discrete nature of the machine 
allows the problem of improving a program to 
be phrased in terms similar to the problem of 
playing chess or proving theorems. 

Interpretation as Problem Solving 

The only acceptable dictum for designing 
the external language is that it cover the full 
range of expressive devices. It is easy to 
state some of these. We want the ability to sub- 
stitute expressions for terms; to add new terms 
and abbreviations; to express new language con- 
ventions; to make ambiguous and vague statements; 
to state problems as well as procedures; to mix 
syntax and semantics; and so on. Each of these 
is needed, not only for communication, but be- 
cause the machine uses this language for all 
its internal control and problem solving. Con- 
sider the possibility for ambiguity, which may 
seem an odd property to desire. If the machine 
cannot state an ambiguous notion, it can never 
get started on a series of successive approxi- 
mations to achieve an exact notion. And if the 
man must be unambiguous in communicating to the 
machine, he can never get it to help him formu- 
late a difficult computation. 


A discussion of the mechanisms required 
to interpret these features is beyond the bounds 
of this paper. However, one general mechanism 
is relevant to all of them — the treatment of 
interpretation as problem solving. The inter- 
preter in the machine for a given expression is a 
branching tree of processes. Each node of the 
tree represents a set of hypotheses about the 
meaning of the expression (more precisely, about 
the implication of the expression for current 
action). Each branch represents an additional 
hypothesis about the meaning, so that different 
branches at a node represent alternative inter- 
pretations of the expression. The processes at 
the nodes carry out the analysis of the expres- 
sion. Each is specialized to the hypotheses at 
the node, and operationally represents these 
hypotheses. These processes detect information 
that would confirm or reject the hypotheses, and 
if the latter, terminate interpretation along 
their branch. They find new information that 
leads to additional hypotheses, and create pro- 
cesses to continue the interpretation within 
these more restrictive bounds. They also take 
direct action in building up the action process 
in the Factory. 

Viewed this way, interpretation is the 
process of discovering by a series of inter- 
pretive acts a final action process that is 
consistent with all the information extracted 
from the expression. The search goes on in 
parallel with all alternative interpretations 
being carried forward. If the expression is 
clear, then one line emerges cleanly and im- 
mediately. If the expression is full of little 
errors, then many short branches occur at each 
node, all but one of which prove false. If the 
expression is ambiguous, more than one line of 
interpretation continues through to a complete 
action program, and information from more dis- 
tant sources must be sought. 

To give one simple example of this pro- 
cess, consider the interpretation of the re- 
quest "Compte Ke“ 2 ^ for K = 1 and 2." Assuming 
that nothing is known about the expression, 
interpretation starts with a standard process. 
"Compte" is recognized as not corresponding to 
a word, and the interpretive hypothesis is taken 
that the user meant "Compute." On this basis 
a new interpretive process is set up that assumes 
some formula for numerical computation will 
follow. When Ke“ 2 ^ is found this is confirmed. 
This action is dependent on the first. If the 
initial word had been "Differentiate," then a 
form for numerical computation would be wrong. 
Also, the hypotheses needn't have been confirmed. 
If the original expression had been "Compte is 
to mean the same as Compute," then a numerical 
computation was not desired and an alternative 
hypothesis is needed, say that "Compte" is being 
used as a name for itself. Getting the form con- 
stitutes a second interpretive act and another 
interpretive process is created which matches 
the form to the expression. This results in ad- 
ditional interpretive acts as x is identified with 
K and a is identified with -2. The final inter- 



pretive act occurs when the print process is put 
in. Since no indication is given for’ what the 
machine is to do with the computation, this is 
a hypothesis that the user wants the information 
given to him immediately. Its confirmation must 
wait until after the action process has been ex- 
ecuted, and the user reacts to the result. 

Although this mechanism does not solve the 
problems of interpreting the devices mentioned 
earlier, it seems to be a necessary feature of an 
intelligent interpreter. 


last Thoughts 

Speculation stops here. The problems of 
making an intelligent technician from a Holland 
machine form an expanding tree which could be 
explored both broader and deeper. 

Each of the topics touched on in the paper 
was left incomplete: the nature of the modules 
that can achieve the properties stipulated; the 
nature of the telephone exchange; the mechanisms 
for interpreting the external language; and so 
on. Major aspects were not even mentioned: the 
problem of going from an object to its symbol; 
the problem of organizing the machine's knowledge; 
the problem of context; and so on. But at this 
stage of development, with the microelectronic 
techniques still a little ways distant, it is ap- 
propriate to end with a sense of incompleteness 
and need for further exploration. 
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ON A POTENTIAL CUSTOMER FOR AN INTELLIGENT TECHNICIAN 


C. West Churchman 
University of California 
Berkeley 


The manner in which the other panel members 
have answered the questions posed to them permits 
me to consider my own without constraints. I've 
asked myself, therefore, to name one class of po- 
tential customers of an intelligent technician- 
computer , whose problems are clearly important, 
clearly in need of computational aids, clearly 
demanding the kind of intelligence which Newell 
describes. 

Now one ready reply to such a query is in 
the form of "more and better"; one can scan ex- 
isting customers and reflect on the ways in which 
they can more fully and more adequately be served: 
the physical scientist, the industrial or govern- 
ment data processor or decision maker; the be- 
havioral scientist. But whatever one might say 
of these consumer needs for more and better com- 
putational aids would not, I think, catch the 
spirit of free speculation which the role assigned 
to this paper is supposed to capture. 

Instead, suppose one begins by ignoring the 
product for the moment, and asking instead: what 
is the most difficult problem the human being 
faces today? The first reply that comes to mind 
seems also to be the right one: the bargaining 
problem. 

A bargaining problem is one in which the 
decision to be made must be composed of the de- 
cisions of two or more individuals with partially 
conflicting objectives. It is illustrated in 
purchasing, voting, labor -management disputes, 
legislation, treaties, and the present luke cold 
war. Our ignorance concerning optimal methods 
of solving bargaining problems is perhaps best 
illustrated by the fact that in the present inter- 
national situations all parties are far more in- 
terested in research that will make each separate- 
ly more powerful, than they are in research that 
will make each a better bargainer. 

Suppose we try briefly to break down the 
components of the bargaining problem, and then 
ask whether an intelligent technician might or 
might not be of service. (1) Each bargainer 
must try to determine the set of alternative de- 
cisions which he can make; (2) Each bargainer 
must try to determine the set of alternative 
decisions the other parties can make; (3) Each 
bargainer must try to construct an outcome matrix 
which predicts the consequence over time of each 
combination of decisions; (4) Each bargainer must 
try to estimate the likelihood that specific 
choices will be made by the other parties. 

This list is obvious enough, and is not com- 
plete. The other items of the list are more sub- 
tle, and consideration of them can momentarily 


be postponed. If we concentrate, then, on the 
obvious, it is not difficult to see that in real 
bargaining situations there is at least a potent 
tial use for the kind of computational mind 
which has been discussed in this panel. The al- 
ternative sets are practically never given. In- 
deed, if we think of the management-labor example, 
it's clear that year by year the parties learn 
more and more about the possibilities open to 
them. If we think in terms of new concepts, 
labor has learned the walk-out, the general 
strike, the sit-down, the fringe benefit, the 
escalator, etc. It is reasonable enough, I think, 
to ask whether a mind adept at developing new 
strategic concepts more rapidly than human chess 
players, might not be equally adept in the area 
of collective bargaining. It is also reasonable 
to suppose that in real bargaining situations the 
parties are only conscious of a very limited part 
of the outcome matrix, even if this matrix is 
confined to alternatives of which they are aware. 
In the recent steel strike, considerable atten- 
tion was paid to the inflationary spiral by the 
government and management, even though everyone 
could sense that this was only part of the des- 
cription of some of the outcomes. Indeed, it is 
safe to say that most of us have only a very vague 
idea of what the parties themselves thought the 
consequences would be. Here again each party 
might well benefit from an economic simulator 
which would start with the economic axioms the 
party was willing to admit and forecast the con- 
ditions that would follow any agreement for 
suitable time spans. Labor-management bargainr- 
ing will be with us for a long time, and the 
research need not be restricted to the period 
when the actual bargaining takes place. As for 
the fourth point, - the likehood of the other 
parties' strategies - this too fits the mentality 
of the intelligent technician whether or not the 
other parties are also making use of similar 
devices. 

All I can do here is to pose a suggestion, 
and not answer it. A number of obvious difficul- 
ties in using computers in bargaining would occur 
to the layman; it would be a matter for bargain- 
ing experts to determine whether the costs are 
properly balanced by the returns. Possibly some 
controlled experiments of bargainer with and with- 
out computer aids would be valuable in this re- 
gard. However, instead of loading this argument 
with the weight of technical details, it seems 
wiser to complete the broad picture of the sug- 
gestion by turning attention to the more subtle 
issues mentioned earlier. These are the issues 
of agreement and cooperation. We add to the list 
already given the following: (5) The parties 
may decide where they agree and disagree with 
respect to their determinations of the alter- 



natives, the outcome matrix, or the likehood of 
alternative moves. (6) The parties may deter- 
mine the alternative methods of settling their 
dispute. (7) The parties may decide where they 
agree on alternative settling methods. 

These three additions to the list might be 
described as aspects of "meta-bargaining", for 
they are concerned with the bargaining of general 
methods of settling specific bargaining issues. 

It is not clear of course that a step in this 
direction is necessarily desirable for all par- 
ties concerned. In the labor-management issues, 
the public presumably wants all parties to do all 
three of these things. It wants the "facts", 
which is its vague way of saying it wants to ob- 
tain an outcome matrix that all parties will 
agree upon. It is not misusing terminology, I 
think, to suggest that morality demands of bar- 
gainers that they proceed when necessary to meta- 
bargainingj such terminology suggests that the 
rest relevant of humanity is best served if this 
step is taken. The last step if successful may 
be the basis of "solving" bargaining problems, 
for though the parties may not agree individually 
on what ought to be done in a specific instance, 
they may agree on a method of settling their 
specific disagreements. It should be pointed out 
that in most real bargaining situations we don't 
know that this last step has failed, because we 
don't know that the other steps were really 
carried out. Specifically, we don't know that 
the parties really disagreed on the facts. 

In all this process too the potential of the 
computer seems real. Specifically, we have 
learned new concepts of settling disputes: the 
vote, the fact find board, legislative action, 
etc. Can we learn any faster, and could a comr- 
puter technician help speed the process? If we 
allow this free speculation to proceed to its 
ultimate limit, could any of this assistance be 
applied to the bargaining of nations? Would an 
intelligent technician be of any real value over 
the years to the U.N.? 

One footnote should be added. It will be 
noticed I'm sure that no where have I mentioned 
the problem of values, though this is undoubtedly 
central to the whole bargaining problem. Further, 
I did not include a pay-off matrix as one of the 
aspects of the bargaining problem. I did not, 
because I believe that emphasis on pay-off 
matrices inverts the bargaining problem unrealis- 
tically. If we begin our analysis with the pay- 
off matrix, we must ask how the parties will 
settle, given the utilities that accrue to each 
as a result of a specific set of choices? whereas 
it seems to me that we should more naturally ask 
what choices the parties would adopt if they 
realized the alternatives open to them and the 
consequences of each alternative. The way in 
which they would behave as they learned more 
about the situation is evidence of their values; 
values are research devices for predicting be- 
havior when the parties are enlightened. There- 
fore I don't think it's wise for the parties to 


try to visualize their problem as a game with 
specific payoffs, because if they really under- 
stood the payoffs they'd really understand how 
each would behave in the bargaining situation 
where the facts are agreed upon. If the original 
payoff matrix includes only the utilities of each 
bargainer under the supposition that he were the 
sole individual involved, then such a matrix con- 
tains very inadequate information concerning the 
values of the bargainers in a true bargaining 
situation. Having said this much, I should also 
remark that my seventh point is a very critical 
one. Just how critical it is to a behavioristic 
theory of values can be demonstrated by posing a 
postulate of this theory: if all the parties to 
a bargaining process cannot agree on a method of 
settling disagreements, given that they all agree 
on the factual outcome matrix, then the values of 
the interested parties are indeterminable. 
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Summary 

Automobile ride studies have progressed from 
simple linear analyses to complex nonlinear 
simulations performed by real-time computers. 

A complete study of automobile ride must include 
the road (input) waves, car suspension dynamics, 
and passenger response to the resulting ride 
motions. A Ride Simulator capable of carrying 
two passengers and subjecting them to realistic 
ride motions offers a new approach to automobile 
suspension design. The basic simulator, which 
consists of a magnetic tape input unit, an analog 
computer for determining car ride motions in 
real time, and a servo-controlled motion 
simulator, are described in detail. 

Introduction 

The study of automobile ride began shortly 
after the first automobile was built and has 
continued to the present day. Due to the complex- 
ity of the suspension system much of the work on 
ride has necessarily proceeded on the basis of 
empirical experience. In general, ride has been 
improved largely by means of specific cut and 
try modifications on prototype vehicles under the 
guidance of expert test driver opinion. Numerous 
attempts have also been made to analyze the 
suspension system mathematically but these have 
been only partially successful. It is the purpose 
of this paper to define the ride problem, to relate 
briefly the significant analytical efforts of the 
past, and to give in detail the use of modern high 
speed computing equipment in a new attempt to 
solve it. 

Definition of Ride Motions 

The automobile considered as a rigid body may 
experience any of six different types of motion. 

In the automobile these six motions are denoted 
by the linear velocities u, v, and w and the 
angular velocities p, q, and r as shown in Figure 
1 . The axis system is fixed in the car and moves 
as the car moves. Because the automobile 
possesses a plane of symmetry, the symmetric 
degrees of freedom (u, w, and q) do not couple 
with the asymmetric degrees of freedom (v, p, 
and r). Consequently, it is possible to break the 
problem of automobile dynamics into two separate 


problems: the first involving motion in the plane 
of symmetry called longitudinal motion and the 
second involving motion out of the plane of 
symmetry called lateral motion. 

The fore and aft motion in the plane of 
symmetry, denoted by u, relates to the accelera- 
tion and braking characteristics of the car and is 
called the "performance"motion. The up-and- 
down and pitching motions in the plane of sym- 
metry, denoted by w and q, are together consider- 
ed the primary "ride" motions of the car, while 
the three remaining motions (yawing, rolling, 
and sideslipping) are generally associated with 
steering control and are called the "handling" 
motions . 

Since the "ride" and "handling" motions are 
largely uncoupled it is possible to study them 
separately. Consequently a study of "ride" can 
be begun by studying only two motions, bouncing 
and pitching. 

Historical Development of Ride Analysis 

The earliest work of importance in analyzing 
ride was done in England by Rowell. ^ This work 
was done at such an early date (1922) that shock 
absorbers had not come into general use and the 
only damping in the suspension was due to friction 
in the leaf springs. Rowell's analytic work in the 
realm of undamped, free vibration was later 
extended by Guest^ and Olley^ . As is frequently 
the case, the automobile suspension system is 
such a nonlinear complex that analytic efforts to 
understand suspension performance quickly 
bogged down. It became apparent that computer 
simulation of some sort would be required to 
solve the suspension problem. The first analog 
computer used in suspension analysis was a 
mechanical differential analyzer built specifically 
for the task by Schilling and Fuchs in 1941 . ^ It 
solved the suspension problem for only a single 
degree of freedom but did permit the inclusion of 
a nonlinear characteristic for the shock absorber. 
A particularly important use of this computer 
was the determination of transient vertical 
acceleration characteristics as experienced by 
the passengers, and how these transients could be 
varied by altering the shock absorber character- 
istics . 
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A further advance in the analysis of ride was 
made by Jeska in 1953 when a four degree of 
freedom model was studied. ^ The motions 
considered were the vertical motions of the front 
and rear wheels together with the bounce and 
pitch of the body. The analysis included an actual 
road wave, measured by a photographic technique 
and a transportation lag equal to the car's wheel- 
base divided by the car's forward velocity. The 
incorporation of the transport lag was made 
necessary by the fact that a road disturbance 
which excites the front end of the car will also 
excite the rear of the car at a known later time. 
Good agreement was obtained between the analog 
simulation and experimental data obtained by 
driving a car over a known stretch of road. 

The General Ride Problem 

It can be seen that prior to the beginning of 
this study the analysis of ride had been concerned 
primarily with the suspension system. A small 
amount of work had been done on passenger 
tolerance to vibration but not in sufficient detail 
to be useful. There was little or no data on the 
types of irregularities present in various road 
surfaces and because of this lack, investigators 
were using single sine waves or step bumps 
(curbs) for input disturbances. Viewed in an 
overall sense the ride problem can be seen to be 
threefold in nature because it concerns the road, 
the car, and the passenger. Consequently, a 
complete analysis of ride must take into consider- 
ation all three quantities. 

Ride Analysis Through Simulation 

Subsequently it became apparent that high 
speed analog computers could be successfully 
employed to solve the equations of ride motion in 
greater complexity than had previously been 
attempted. Tape recorders had been developed 
satisfactorily such that if a road wave could once 
be recorded on magnetic tape, it could then be 
"played" into the analog computer as an input 
disturbance to the suspension system. By using 
two pickup heads the road wave could be deter- 
mined at two points analogous to the front and 
rear wheels of the car. There only remained the 
problem of inserting the passenger into the 
analysis. It seemed reasonable that the best way 
to include the passenger in the analysis was to 
put him in physically. This has been accomplish- 
ed by constructing a servo-controlled motion 
simulator which can seat two passengers and 
which follows the motions determined by the 
analog computer. An overall schematic view of 
this system, which is called a Ride Simulator, 
is shown in Figure 2. It is composed of 


(1) A tape recorder to provide road wave 
inputs into the system. The signal on 
the tape is picked off at two points 
simulating the front and rear wheels 
of the car. The tape speed is variable 
and by speeding up the tape, the car 
being simulated is also "speeded up". 

(2) An analog computer which takes in the 
signals from the tape recorder and, 
from these inputs to the car's suspension, 
determines what car body motions would 
result. The analog computer runs in 
"real time", which means that the ride 
responses that are generated by the 
computer occur at the same rate as 
would the equivalent ride responses of 

a real car on a real road. 

(3) A motion simulator to reproduce the car 
body motions determined by the computer. 
The simulator is comprised of a portion 
of a standard car body and is driven by 
electro-hydraulic servo-mechanisms 
whose inputs are output voltages from 

the analog computer. Two passengers 
may be accommodated in the motion 
simulator and the driver may control 
the speed of the car by pressing the 
brake or accelerator which sends a 
signal to a small computer which 
determines the resulting car speed 
and which, in turn, sends a speed 
signal to the tape recorder. 

Since the Ride Simulator is comprised of three 
rather complex components, the following 
sections of this paper will be devoted to a detailed 
description of each part. 

Tape Recorder 

Determination of Road Profiles - As previously 
noted, a source of real difficulty in simulating 
automobile suspensions has been the lack of 
detailed data on road profiles . Numerous efforts 
have been made in the past to determine road 
profiles, among them are direct surveying 
methods and specialized surveying methods using 
a light beam as a space reference. In general , 
the shortcomings of these methods are that they 
are tedious and that they are not adaptable to long 
stretches of road -- say ten or twenty miles. 
General Motors is currently exploring another 
possibility -- that of obtaining the vertical 
accelerations induced in a towed wheel as it is 
rolled along the road and then integrating the 
acceleration twice to obtain displacement. The 
wheel used in this work, shown in Figure 3 , iis 
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essentially a bicycle wheel fitted with a special 
tire. Also shown in this figure is a standard 
fifth wheel which is used to generate a speed 
signal. The wheel is preloaded with a low rate 
spring to maintain contact with the ground. The 
wheel-spring combination acts as a low-pass 
filter with a break frequency of about 60 cps . 

The accelerometer output is passed through a 
high-pass filter that has a slope of 18 db per 
octave and a break frequency of one-half radian 
per second. This filter is composed of operation- 
al amplifiers, as shown in Figure 4, and the 
double integration is performed within the loop. 

In Figure 4, S 2 X represents the acceleration in- 
put and Y is the displacement output. Although 
there remains considerable work to be done 
before this system is completely satisfactory, a 
number of successful short time runs have been 
made. An oscillograph record of one of these 
runs is shown in Figure 5. The road profile used 
in this test was a home-made triangular bump. 

The acceleration resulting when the wheel rolled 
over the bump is shown in the center of Figure 5 
and the displacement, recovered successfully by 
a double integration, is shown at the top. The 
measure of the wheel suspension displacement 
is used primarily as an event marker on the 
oscillograph record. 

In addition to recording particular sections 
of various road surfaces, it is also possible to 
characterize the road surface statistically. 

Then by use of electronic noise generators and 
appropriate filters, it is possible to determine 
ride responses for a known statistical input with- 
out recourse to recording of actual roads. 
Although there is very little data available on 
highways, the statistical approach, employing 
power spectral densities, has been satisfactorily 
used to determine ground loads induced in air- 
craft when taxiing over rough runways.® 

Time Delay Generation - Once the road waves 
have been recorded on magnetic tape, there 
remains the problem of providing the inputs to the 
front and rear suspension. For cars with equal 
front and rear treads, the road input functions 
for the wheels on each side of the car are identi- 
cal but displaced with respect to each other by 
the length of the wheelbase. Previously a number 
of special purpose road function generators had 
been built in order to overcome this obstacle. In 
the case of the Ride Simulator, a general purpose 
magnetic tape unit was utilized to provide the 
delay in addition to acting as the input road wave 
generator. ^ This unit, shown in Figure 6, 
utilizes a modified commercial tape transport 
with two magnetic heads. The lower head is used 
for recording purposes, and both heads are used 
for reproducing purposes. When the signal is 


picked off by both the lower and the upper heads 
there is a delay in the signal of T = d/V seconds, 
where d is the distance between the heads and Y 
is the tape speed. The tape speed is directly 
analogous to the car's speed and, similarly, the 
spacing between the magnetic heads is analogous 
to the car's wheelbase. Each head will handle 
two channels of information so that one channel 
represents the road function under the right side 
of the car, the other, the left. 

The tape recorder employs pulse-width 
modulation as a recording means . The delay 
range available is .04 to 1.5 seconds. The tape 
speed control is continuous over the range 4"/ 
sec. to 40"/sec. This corresponds to car speeds 
in the range 15 to 150 mph. The unit will accept 
signals in the frequency range of 0 to 30 cycles 
per second. Input and output impedances are 
similar to those normally associated with analog 
computer amplifiers . 

Automobile Simulation 

Seven Degree of Freedom Model - The car 
model simulated in the computer is the seven 
degree of freedom system shown in Figure 7. 

This system considers the vertical motions of all 
four wheels plus the bounce, pitch, and roll of the 
body. Although the early analysis of ride motions 
did not include roll effects, the high human sensi- 
tivity to roll displacements literally dictates that 
rolling motions be incorporated in a complete ride 
study. Only typical car dimensions and param- 
eters are given in this figure to reduce clutter. 

The simulation of this system involves summing 
the forces on each wheel and summing the forces 
and torques that act on the body. 

The general procedure is shown in Figure 8. 
The system is put into motion by the road wave 
input which enters the tire. The difference be- 
tween the road wave input and the wheel motion, 
acting through the tire characteristic, produces 
the tire force. Motion of the wheel mass results 
from the difference of the tire force and the 
suspension force . The suspension force is pro- 
duced by the difference of wheel motion and body 
motion acting through the suspension characteris- 
tic. Finally, the body motion results from the 
action of the suspension force upon it. Lagrange's 
equations were employed to derive the equations 
of motion of the ride system. These equations 
are given in detail elsewhere.® 

In order to determine numerical values of the 
parameters required in the equations of motion, 
it was necessary to measure the inertia, spring, 
and damping characteristics of the body, 
suspension and tires. For the study a typical 



288 

10.1 


1958 production car was chosen. Fortunately 
most of the required parameters could be meas- 
ured on existing test apparatus. For example, 
the pitching moment of inertia was measured by 
swinging the car as part of a compound pendulum. 
Spring rates were determined on a standard test 
rig that applied force to the suspension and 
measured the resulting deflections. Shock 
absorber characteristics were determined by a 
stroking machine which gave shock absorber 
force as a function of input velocity. Tire 
damping was measured by applying a sharp pulse 
of force to the tire and determining the time the 
oscillation required to damp out. In all, there 
are 37 car parameters used in the simulation.® 

Analog Computer - With the equations of 
motion and the car parameters in hand, the car 
ride response was determined on an analog 
computer. The computer solves the equations of 
motion by summing the various forces acting on 
each mass or inertia. Figure 9 shows this pro- 
cedure for one end of the car whose vertical 
motion has been denoted by the coordinate X. 

The forces acting on the mass of this end of the 
car are summed at the variable gain amplifier 
which is used as an integrator. The acceleration 
of this mass is equal to the sum of the input 
forces divided by the mass under consideration. 
The variable gain amplifier of Figure 9 performs 
the three functions of summing the forces, 
dividing by the mass, and integrating the result- 
ing acceleration into velocity. In Figure 9 this 
velocity is denoted as the output (dX/dt) of the 
variable gain integrator. The velocity is then 
integrated in a second fixed-gain integrator to 
produce the displacement -X. The displacement 
signal is then inverted to +X by the inverter. In 
the computer there are seven circuits like the one 
shown in Figure 9 together with nonlinear function 
generators to represent the springs, shock 
absorbers and tires. The outputs of the computer 
representing the motion of the front, rear, and 
roll of the car are fed to the motion simulator. 

It should be noted that not only the displacement 
signal X but also the velocity signal dX/dt is fed 
to the motion simulator. The reason for this will 
be made clear in a later section. 

The analog computer used to solve the equations 
is a general purpose unit, no special equipment 
being required. In all, 82 operational amplifiers 
are required to solve the problem and provide 
coordinate transformations, and as many as 
sixteen function generators may be used depend- 
ing on how many nonlinearities are considered. 

Comparison Between Analog and Digital 
Solutions - An analog computer solution of the 
linear seven degree of freedom system is given 


in Figure 10. The corresponding digital check 
solution is indicated by dots. The input used was 
a step bump or curb on the left side of the car 
only. The three traces show the roll response 
{ ) > the front end vertical motion (Z^), and the 

rear end vertical motion (Z r ). The car param- 
eters are those of a typical production car 
traveling at 45 miles per hour. This simulation 
was carried out in real time, and the agreement 
between the analog and digital computers is quite 
good. 

Motion Simulator 

A view of the motion simulator together with the 
control console which controls it is shown in 
Figure 11. The body which can carry two passen- 
gers is shown without hood and fenders. The 
weight of the body and frame are supported by 
hydraulic pressure in the central cylinder. The 
hydraulic pressure is supplied by an accumulator 
which also acts as a very low rate spring. 

Vertical motions at the front and rear of the car 
are provided by valve- controlled hydraulic 
cylinders. These servo cylinders exert vertical 
forces on movable pads which are guided on rails. 
These pads are connected to the ends of the car 
frame and, consequently, when the pad moves 
the car frame moves also. The servo cylinders 
are located approximately at the centers of 
percussion in order to prevent the front servo 
from loading the rear and vice versa. Roll 
actuation is provided by a rotary actuator mounted 
on the rear movable pad. It is carried up and 
down in pitch and bounce and as a result there is 
no appreciable coupling between pitch, bounce and 
roll. It is worth emphasizing that the central 
cylinder supports the entire weight of the car body 
and passengers, and that the servos are used only 
to provide dynamic forces to produce motion. The 
valves which control the flow of oil to the servo- 
cylinders are actuated by voltages supplied by the 
computer. A block diagram of one of the servo 
systems is shown in Figure 12. The input to the 
servo is the voltage E : and the output is X. The 
servo loop is completed by a voltage Ep supplied 
by movement of the wiper on the feedback 
potentiometer. The transfer function of this 
simplified system, which neglects oil spring 
effects and numerous nonlinearities, is x/Ej , 
which is the inches of simulator movement per 
volt input to the servo. It can be seen that the 
transfer function can be written as 

X = G = Inches of Displacement 

E! 1 + Tj S Volt Input 

Thus it may be seen that the system will not 
give good reproduction of output frequencies much 
higher than l/Tj radians per second. Fortunately 



289 

10.1 


the analog computer provides a way around this 
frequency limitation. For example, if the system 
were perfect, the system response to a voltage 
E c from the computer would be 

X = G 



If the simulator is to track a voltage E c it can be 
shown that the servo input Ej should be made 
equal to E c + T 1 SE C . To prove this set 



But since x/Ei = G/(l + TjS), it is clear that 
Ej = E c (1 + TjS) = E c + T x ^ d E c j 

Thus it may be seen that if the motion simulator 
is accurately represented by the simple transfer 
function G/(l + Ti S) that the simulator may be 
made to track a voltage E c from the computer 
provided that the input voltage to the servo is the 
tracked voltage E c + T x times its time derivative. 
Fortunately this derivative already exists in the 
computer and it is a simple matter to multiply it 
by Ti and feed it to the servo. This action 
effectively cancels an unwanted pole in the trans- 
fer function. The effect of this cancellation com- 
pensation is shown in the frequency response 
curves of Figure 13. It may be seen that the 
normal servo system has a bandwidth of about 
4 cycles per second (the bandwidth is the maxi- 
mum frequency at which the amplitude ratio is 
within + 3 db of the amplitude ratio at zero 
frequency). The effect of cancellation compensa- 
tion is to extend the bandwidth to 7 cycles per 
second. It is also clear that the transfer function 
of the system is more involved than the simple 
formula shown previously. The main perform- 
ance figures of the Ride Simulator are given in 
Table I. 



Pitch 

Roll 

Maximum Acceleration, Deg. /sec 2 

1,150 

500 

Maximum Velocity , Deg. /sec. 

48 

240 

Maximum Displacement, Degrees 

+ 6 

+ 10 

Bandwidth, cps 

5 

4 

Positional Accuracy, Degrees 

.025 

.10 


TABLE I- A: Ride Simulator 

Rotational Performance 


Bounce 


Maximum Acceleration, In. /sec 2 1,250 

Maximum Velocity, In. /sec. 40 

Maximum Displacement, Inches + 5 

Bandwidth, cps 5 

Positional Accuracy, Inches . 020 


TABLE I-B: Ride Simulator 

Translational Performance 

The bandwidth figures given represent the 
performance of the servos when cancellation 
compensation is used. 

The simulator performance requirements 
originated in road test data which determined 
extremes in each of the variables. The road data 
was determined by driving an instrumented test 
car over a number of different bumps at various 
speeds and determining the peaks of the accelera- 
tion, velocity, and displacement transients . From 
a collection of these curves, an extreme for each 
variable was obtained. A comparison of the 
observed maximum of each car motion variable 
with the maximum of the corresponding Ride 
Simulator motion variable is shown in Table II. 



Bounce 

Pitch 

Roll 

Maximum Acceleration 

540% 

640% 

125% 

Maximum Velocity 

130% 

340% 

850% 

Maximum Displacement 

83% 

120% 

110% 


TABLE II: Simulator Performance Capability 
Relative to Maximum Car Motions 

It may be seen that the Simulator has the 
capability of reproducing the extremes of real car 
motion with the exception of bounce displacement. 
This is not considered a major drawback since a 
five inch bump, which the Simulator can accom- 
modate, represents a violent disturbance to the 
passengers at almost any road speed. 

In addition to providing a means of extending 
the apparent servo bandwidth, the computer 
calculates the coordinate transformation required 
when the center of gravity of the simulated car 
does not correspond with the fixed pivot of the 
motion simulator. By means of another coordin- 
ate transformation it is possible to provide the 
sensations of rear seat ride to passengers 
occupying the front seats of the motion simulator. 
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Comparison of Overall Response 
to Real Car Response 


References 


The real point of a ride simulator is that the 
simulator will produce to a high degree of 
fidelity the motions experienced by a real car on 
a real road. To prove out the complete system a 
typical production car whose parameters had been 
completely determined was instrumented with an 
integrating accelerometer and then was driven 
over a triangular bump at 20 miles per hour. 

The triangular bump used in this work was 60 
inches long and 4.8 inches high. Then the Ride 
Simulator was put through the same procedure, 
namely: 

(a) An identical triangular bump was 
recorded on magnetic tape and played 
into the computer by the Simulag. 

(b) The computer determined what the 
car's responses should be and sent 
motion signals to the motion simulator. 

(c) The simulator moved representing the 
motion of the car over the bump. 

To demonstrate the fidelity of the overall 
simulator, the integrating accelerometer was 
removed from the car and was installed in the 
same location in the motion simulator body. The 
results of this test which compare the response 
of a typical 1958 production car with a simulated 
version of the same car are shown in Figure 14. 

A comparison of accelerations is shown at the top 
of this figure and a comparison of displacements 
at the bottom. The agreement is generally good. 
The oscillations in the acceleration trace for the 
production car are due to a stiffer rear floor pan 
where the accelerometer was mounted. 

Conclusion 
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The Ride Simulator has demonstrated its ability 
to reproduce the on-the-road ride response of 
real cars . The way is now open to evaluate new 
suspension systems without recourse to expensive 
component constructions and to study the passen- 
ger and seat combination under closely controlled 
conditions . The Ride Simulator is an important 
step forward in ride analysis because it will 
enable improvements in ride to be made with a 
minimum of time and effort. 
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Figure 2 

SCHEMATIC VIEW OF THE RIDE SIMULATOR 
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Figure 3 

INSTRUMENTED WHEEL USED TO MEASURE ROAD WAVES 
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Figure 5> 

TRIANGULAR ROAD WAVE DETERMINED BI INSTRUMENTED WHEEL 
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Figure 6 

TAPE RECORDER USED FOR ROAD WAVE GENERATION AND TIME DELAY 
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Figure 9 

INTEGRATION OF ACCELERATION TO DISPLACEMENT 



Figure 10 

COMPARISON OF ANALOG AND DIGITAL SOLUTIONS 
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OVERALL VIEW OF RIDE SIMULATOR SHOWING COMHJTER, MOTION SIMULATOR, 
AND CONTROL CONSOLE 
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Figure 1M 

ACCELERATION COMPARISON OF REAL AND SIMULATED CARS 
PASSING OVER TRIANGULAR BUMP 



Figure 1UB 

DISPLACEMENT COMPARISON OF REAL AND SIMULATED CARS 
PASSING OVER TRIANGULAR BUMP 
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Summary 

An analog simulator with specially designed 
electromechanical equipment was developed to 
simulate one of three loops of the Enrico Fermi 
Atomic Power Plant, including all elements except 
the steam turbine; i.e., the reactor, intermediate 
heat exchanger, steam generator, mixing and 
transport delay phenomena, and steam system 
hydraulics. 

A philosophy of plant control was evolved on 
the basis of plant operating characteristics, as 
determined by the simulator and supported by a 
steady-state digital analysis of the plant. Concep- 
tual control hardware designs were then tested on 
the simulator and an optimum design was selected 
and tuned. Actual hardware of the automatic con- 
trol system was simulated, and this simulation 
circuitry incorporated in the computer. 

Control consoles exactly duplicating the con- 
trol panel layout in the actual plant were connected 
to the analog simulator to adapt it for power plant 
operator training. Trainees thus get the "feel" of 
plant operating characteristics, learn to meet 
emergencies, and practice standard operating 
procedures, using both manual and automatic 
controls. 

The facility has already proved valuable in 
studying normal and abnormal operating condi- 
tions and remains available for any future systems 
analysis needs; meanwhile it is in continuing 
service for operator training. 

Introduction 

The Enrico Fermi Atomic Power Plant on the 
shore of Lake Erie near Monroe, Michigan, has a 


sodium- cooled fast breeder reactor as its heat 
source. Designed by Atomic Power Development 
Associates, Inc., it is being constructed under the 
AEC Power Demonstration Reactor Program, but 
is to be owned and operated by the Power Reactor 
Development Company, a non-profit organization 
of 25 industrial and utilities firms. 

In the thermal portion of the plant, flowing 
liquid sodium will carry heat from the reactor to 
once-through steam generators where super- 
heated steam will be produced. This steam will be 
delivered to a turbine -generator built and operated 
by the Detroit Edison Co. Normal power output 
will be 100 MW (electrical) with steam at 740°F 
and 600 psia. 

Some three years ago a contract was entered 
into by Power Reactor Development Co. and Holley 
Carburetor Co. for the study and simulation of an 
operating power control system for the Fermi 
plant. Atomic Power Development Associates, Inc. 
(APDA) was to serve as technical liaison agent for 
Power Reactor Development Co. (PRDC). Objec- 
tives of the program were: 

1. To simulate plant performance over a wide 
power range. 

2. To develop a recommended concept for an 
operating control system. 

3. To convert the simulator into a training 
device for power plant operators. 

Because the variables which must be con- 
trolled to regulate power production from such a 
plant are intimately related, with a change at one 
point causing changes to occur at other points, 
development of a sensitive and accurate control 
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system was necessary to establish safe and stable 
power plant operation. Uniqueness of certain 
elements of the plant further complicated control 
planning, and many of the details of control system 
evolution recorded herein represented pioneering 
research efforts. 

Early planning in the first phase of the pro- 
ject called for parallel digital and analog computer 
studies to provide a dual approach to plant simu- 
lation and determination of adequate control 
system requirements. Analog simulation proved 
workable for the entire problem, but digital 
studies ran into complications of excessive com- 
puting time for certain aspects of plant operation 


and therefore could not be relied on for a complete 
transient solution. The digital work, however, was 
useful in supporting and verifying analog results. 

Power Plant Description 

Figure 1 presents a block diagram of the 
entire plant. In the power section are the reactor 
and three parallel heat transmission circuits, 
delivering reactor thermal energy in the form of 
superheated steam at 600 psia and 740°F. Each 
circuit begins with a primary loop of liquid sodium, 
circulating through the reactor, absorbing and 
carrying away heat. Since the primary sodium 
coolant is strongly radioactive, it is not pumped 



Figure 1 . Block Diagram of Enrico Fermi Atomic Power Plant Showing All Three Loops 








directly to the steam producing unit, but instead 
passes through an intermediate heat exchanger 
where heat is transferred to a secondary, non- 
radioactive sodium loop. The heated secondary 
sodium then flows to a steam generator where 
feedwater, entering at 600 psia and 340°F, is 
heated, boiled and emerges as superheated steam. 
The once-through steam generators of the cross- 
flow configuration represent a fairly new concept 
in steam power design. 

As shown in Figure 1, the combined output of 
the three steam generators is delivered to a 
single turbine driving the electric generator. Use 
of the three steam generators and three related 
heat supply circuits from the reactor introduces 
a substantial margin of plant safety, since simul- 
taneous failures in all three circuits are extremely 
unlikely. Also included in the steam system are 
steam and feedwater piping and valving, condenser, 
deaerator, feedwater pumps, pump drives and 
emergency water supply. 

In the early months of the study the physical 
details of plant layout had not been determined, nor 
were details available on the actual intermediate 
heat exchangers and steam generators. However, 
conceptual designs and data on required perfor- 
mance of components were available based upon 
an ultimate reactor thermal power of 430 MW. 
The control system was to be designed for initial 
plant operation at 300 MW (referred to as 100 
percent power) but was to be adaptable to higher 
power levels. 

Purpose of the Operating Control System 

This study is specifically concerned with the 
conceptual design of an operating control system 
for the power plant. "Operating" is here distin- 
guished from the power limiting or safety system, 
which has been developed separately to limit large 
power excursions and to act as a safety-shutdown 
device in the event of abnormal operation. 

The purpose of the operating control system 
is the maintenance of power plant operating 
stability, while regulating power in response to 
external demand. In the reactor this connotes 
controlled variations in power output to meet 
changing power demands, and prompt leveling out 
at each new power level, with any tendency to 
"hunt" damped out to the greatest extent possible. 
In the sodium loops temperature rises and drops 
correspond to power output. Related temperature 
changes must be controlled to maintain proper 
phase relationship during power changes to avoid 
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plant instabilities which involve surging of tem- 
perature and power output. To the water-steam 
system, stable plant control calls for regulation of 
pressure and flow to match power output, while 
avoiding pressure or flow surges that would cause 
corresponding fluctuation in steam generator 
performance and thus reflect back into the sodium 
system. 

Initially a control philosophy had to be derived, 
determining which of several variables should be 
selected as controlling, which others they could be 
made to control, and which could remain passive. 
Because the Enrico Fermi Power Plant is to a 
great extent unique, many factors loom large in 
the control problem that have hitherto been 
negligible or nonexistent in prior steam and 
nuclear power development. The complexity of its 
heat cycle, from the standpoint of system dyna- 
mics, its small temperature coefficients, long 
transport times, and the peculiarities of once- 
through-type steam generators: these are factors 
that governed the ultimate choice of control 
philosophy. Once the philosophy had been estab- 
lished, a control system was to be worked out, 
and the required operating characteristics of the 
control components were to be defined. 

Preliminary Planning for Plant Simulation 

Early plans called for simulation of all three 
circuits of the plant, at all power levels from 
source to 150 percent of rated power. Since the 
complexity of the problem would have made this 
approach prohibitive in terms of computer cost 
and maintenance time, the objectives were re- 
laxed wherever possible without seriously com- 
promising the results. First, the simulation was 
restricted to only one of the three heat trans- 
mission circuits, saving nearly two-thirds in 
simulator costs. The technique is valid since all 
circuits are identical in operation, and a 3-to-l 
compensation is easily inserted wherever the 
three circuits come together, as in the reactor. 

Further, because low-power-level simulation 
involves voltages approaching the error band of 
the computer, the stability study was limited to 
higher levels where error effects would be negli- 
gible. With the computer used in the early stages 
of the* study, the minimum level for quantitatively 
accurate simulation was 25 percent of rated plant 
power. 

In addition, because of the advanced state of 
steam power technology, it was felt that the 
water- steam system external to the steam gen- 
erators need not be simulated, as any significant 
transient behavior could be predicted when neces- 
sary on the basis of existing knowledge. 
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Components Simulated Individually 

Within these limitations, it was decided to 
simulate the plant as thoroughly as possible, 
representing all significant components and ef- 
fects individually, avoiding the'Tumping” approach 
of the transfer function technique, where the 
identity of an individual component or phenomenon 
is lost. The individual component approach was 
required because nonlinearities of the heat system 
limited the transfer function approach to approxi- 
mately 5 percent of the power range, while 
simulation was desired over a much wider power 
range. The characteristics of each major com- 
ponent of the plant thus were established by the 
functional interrelationship of its simulated parts 
rather than by mathematical approximation of 
the whole. 

In this way, a clear physical correlation of 
components in actual plant and simulator could be 
maintained, for improved operator comprehension, 
quick location of simulator faults, and ready 
adaptation to changes made in the course of plant 
development. Although it might have been designed 
to operate much faster than the physical plant, 
the simulator was built to operate in real time, 
because of its eventual use as an operator training 
device. 

Identification of significant components and 
effects warranting simulation, and the determina- 
tion of the manner and extent to which they should 
be represented, required considerable prelim- 
inary study. Extensive digital computer analysis 
of the dynamic equations describing individual 
component operation was necessary to devise 
workable analogs for settingup the simulator. The 
digital studies also revealed various undesirable 
steady-state plant operating conditions which the 
control system must be designed to preclude. 

Phase I Simulation 

Following the aforementioned preliminary 
studies, the first plant simulator was assembled 
by Holley Carburetor Company during the early 
months of 1958, using a commercially available 
general-purpose d-c analog computer, together 
with specially designed and built supplementary 
electronic equipment which simulated effects 
impractical to achieve with standard computer 
circuitry. Continuously variable voltages, adjust- 
able and fixed resistors, and capacitors in the 
simulator represent and are analogous to the 
physical variables and parameters of the plant. 


The Phase I simulator represented only one 
circuit of the power-producing section of the plant 
--one reactor, one heat exchanger and one steam 
generator, plus their connecting sodium piping 
and pumps. Simulations of the individual pheno- 
mena and components described in the following 
paragraphs were interconnected in their proper 
sequence, following precisely the actual functional 
relationships of the plant components. Though 
water-steam dynamics were omitted, a simplified 
representation of the feedwater pumps permitted 
a rudimentary simulation of feedwater flow that 
could be changed either manually or automatically. 

Transport Delay 

When a system experiences a transient input 
signal, the system output has some time relation- 
ship to the input. This time delay or phase 
relationship of output to input is an important 
factor in adapting a control to the system. When 
the system includes long piping such as is used in 
the Fermi plant, the piping coolant time (transport 
delay) between and through components is a 
significant element in determining the over -all 
time response of the plant, and simulation of this 
delay is vital to over -all plant simulation. 

Based on the physical analogy of a bucket 
brigade, an electromechanical system was devised 
to simulate in a practical and economical fashion 
the transport time of sodium flow through piping, 
intermediate heat exchanger, and steam generator. 
Two delay wheels, one for the primary loop and 
one for the secondary loop (Figure 2), were 
constructed using capacitors which represent 
slugs of sodium flowing through the pipe or 
element being simulated. Capacitor voltage repre- 
sents sodium temperature. With the wheel rotating, 
the time required for a capacitor on the wheel 
to travel from an inlet to an outlet point is the 
same as the time required for sodium to travel 
the length of the physical pipe. 

The delay wheel speeds may be varied to 
simulate different sodium flowrates. Although the 
basic study was limited to operation at fixed 
sodium flow rate, the variable flow rate feature 
permits consideration of cases of abnormal 
coolant flow, and will allow for higher flow rates 
if required when simulating plant operation at the 
higher power levels planned for the Fermi plant 
after the first few years of operation. 

Mixing 

Plenums at the inlet and outlet of the reactor, 
intermediate heat exchanger and steam generator 
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Figure 2. Two delay wheels based on physical analogy of 
bucket brigade were constructed by Holley personnel. 
Capacitors represent slugs of soduim flowing through pipe 
or element being simulated. 

introduce a mixing phenomenon. If the temperature 
of sodium flowing into one of these plenums 
changes, the outlet temperature will also change. 
However, because incoming sodium mixes with 
that already in the chamber, the temperature 


change at the outlet is more gradual than that at 
the inlet. The relatively sluggish response of 
mixing tanks to inlet temperature transients 
results in a phase shift at the outlet, which 
complicates the control picture, but also has the 
beneficial effect of tending to smooth out transients. 

Reactor 

Power level in the reactor is proportional to 
the neutron population. Any change in power level 
depends on changing the position of the reactor 
control rods and/or temperature within the re- 
actor. Reactor simulation provides for all of the 
interrelationships between neutron population, 
control rod position, internal temperatures, and 
heat extraction rate. The neutron kinetics are 
represented by a classical six-group simulation. 
The output of this circuit, proportional to heat 
produced, goes to separate core and blanket 
simulations. Although their simulation circuits 
are basically the same, the electrical constants 
differ to account for their different sodium flow 
and heat generation rates. Temperature coeffi- 
cients from the core, blanket and structure 
simulations combine with rod position to simulate 
reactivity, which in turn becomes the input to the 
neutron kinetics simulations. 

Intermediate Heat Exchanger 

The intermediate heat exchanger (IHX) simu- 
lated in the Phase I study is of counterflow 
design; the primary or radioactive sodium flows 
through in one direction while the nonradioactive 
sodium flows in the opposite direction. The two 
sodium flows are separated by the walls of the 
stainless steel pipes which carry the secondary 
sodium. In the steady state a uniform temperature 
differential exists from inlet to outlet and heat is 
transferred very efficiently from primary to 
secondary sodium. When a temperature transient 
occurs at either primary or secondary inlet, both 
outlet temperatures change. The close coupling 
characteristic of the IHX causes the greater 
response to a primary inlet temperature transient 
to appear in the secondary outlet while the 
greater response to a secondary inlet transient 
occurs at the primary outlet. 

The previously described delay wheels were 
used as a direct electromechanical simulation of 
the IHX, in an unusual, simplifying application 
shown in Figure 3. The wheels rotate in opposite 
directions, thus simulating the counterflow of 
sodium in the heat exchanger. As the wheels 
rotate, charge flows from the primary wheel 
capacitors to the secondary wheel capacitors via 
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1 10 OF TOTAL LENGTH = 1 SLUG 



Figure 3. Analog Simulation of Intermediate 
Heat Exchanger 

stationary electrical T -networks simulating heat 
transfer from primary to secondary sodium. This 
unique simulation is straightforward, accurate and 
compact, and eliminates a large number of opera- 
tional amplifiers that otherwise would be required. 

Steam Generator 

The once -through steam generator is the most 
sensitive element in the power section of the 
plant. Over the entire power range the sodium 
outlet temperature follows closely any changes in 
sodium inlet temperature, feedwater temperature, 
or changes in sodium or water flow rate. The 
superheated steam outlet temperature also is 
sensitive to sodium inlet temperature. 

Heat carried into the steam generator by the 
sodium is transferred through a separating metal 
wall to the H2O which enters the steam generator 
as water and leaves as steam. The changing phase 
of H2O (from water to low-quality, high-quality 
and superheated steam) introduces complications 
into the analysis and simulation of the steam 
generator because density and heat transfer 
coefficient of the H2O change from phase to phase. 


In developing a simulation, the steam genera- 
tor was divided into 14 sections. Heat flow from 
sodium to H2O in each section was represented 
by using a T-network connecting two capacitors 
(C), simulating heat capacity of sodium and H2O 
respectively, as shown in Figure 4 . A third 
capacitor between represented heat capacity of the 
separating wall, and resistors (R) in the circuit 
represented the heat transfer coefficients. The 
sodium side of this circuit was readily simulated 
by capacitors on the secondary delay wheel already 
described, but the H2O side presented a challenge 
because of the varying characteristics of the H2O. 

Rather than insert a variable resistor and 
capacitor in the circuit to simulate changing heat 
transfer characteristics, it was decided to im- 
press a fictitious voltage E (Figure 4 ) in the 
circuit, which would allow the same heat flow as 
variable elements R and C would produce. 

E had to be computed for each section in 
continuous sequence, and so it was necessary to 
connect the computation circuit with each T-net- 
work in rapid succession using a fast stepping- 
switch which performs several related functions 
at each station (section), sweeping continuously 
from steam generator inlet to outlet, in the 
opposite direction from the rotation of the delay 
wheel used to simulate the sodium side. With E 
so developed, it was possible to use fixed elements 
(a capacitor and resistor) for H2O heat capacity 
and heat transfer coefficient, and yet obtain the 
correct simulated heat flow conditions. 

The H2O profile obtained by this method of 
solution is continously displayed on an oscillo- 
scope and provides an excellent means of examin- 
ing H2O conditions during performance tests. 

Control System Development — Phasel 

After simulations of the individual compo- 
nents described above had been interconnected in 
proper sequence, the simulator was ready both for 
testing plant characteristics and for testing and 
developing control systems for the plant. 

Operating within the limits of basic design 
conditions, and taking into consideration certain 
ground rules established by APDA, it was decided 
that control of the plant on the basis of scheduling 
reactor upper plenum temperature from the de- 
mand power setting offered the best promise of 
maintaining thermal stability of the system, at the 
same time avoiding undesirable operating con- 
ditions. 
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Development of Simulation of Heat Flow 
Relationships in Steam Generator 


Figure 4. Heat Flow .Relationships in the Steam Generator and Development of Their Simulation 


With reactor rod control position and feed- 
water pumping rate as external control mediums, 
several control systems were studied using the 
simulator. The system finally adopted embodies 
rod control in a limited proportional error servo- 
mechanism with first derivative output control; 
feedwater control is a scheduling system with 
integral error bias. Following optimization of this 
control system, plant characteristics were studied 
for a variety of operating conditions, with the 
optimized control system functioning. Figure 5 
presents a block diagram of the control system 
proposed on the basis of Phase I investigation. 

Phase I Results 

In addition to considerable work in analyzing 
the steady-state and dynamic performance of the 


several plant components, the end products of this 
Phase I period were the electronic simulator of 
the plant, and the conceptual design of the operating 
control system. The Phase I objective had been to 
simulate only as much of the plant as necessary to 
include the basic elements directly related to 
power control, such as the reactor, the sodium 
loops, the IHX and the steam generator, omitting 
the section of the plant beyond the steam generator. 
However, as the study advanced, it became appar- 
ent that the counterflow, once -through steam 
generator has a much faster response than con- 
ventional boilers, and that it is in fact the most 
important and sensitive single element affecting 
control of the plant. Therefore its simulation 
should be as accurate as possible, and all factors 
affecting its performance should be simulated. 
Thus the entire water -steam hydraulics would 




POWER SET 



Figure 5. Schematic Diagram of Proposed Control System, Phase I 

















have to be introduced because of the effect of H 2 O 
pressure and flow rate on the heat transfer 
characteristics of the steam generator. Further, 
it had become increasingly desirable to investigate 
operation at lower power levels than possible with 
the simulator available at the termination of Phase 
I. This was primarily due to a new start-up concept, 
which involved flooded steam generators. Hence 
an extension and improvement of the control 
system study was clearly indicated. 

Phase II Simulation and Control Studies 

Phase II of the control system study was 
undertaken to expand the simulator’s accurate 
operating range by using an improved basic com- 
puter, and also to add an accurate simulation of 
water -steam dynamics in the power -conversion 
section of the plant. Other changes in the specially 
designed supplementary components were intro- 
duced to make improvements dictated by the 
Phase I study. Also the new simulator (Figure 6) 
was to be developed to serve for training of plant 
personnel as well as for continuing stability 
analysis. Since operation is the same in either 
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case with a simulator set up to operate in real 
time, the training function could be served simply 
by adding an appropriate training console to the 
basic simulator. 

Further objectives of the Phase II study were 
simulations of scrams (rapid shutdowns), nuclear 
start-up, and power start-up with flooded steam 
generator; a more detailed study of rod control 
components; and development of feedwater control, 
represented simply in Phase I, into a complete 
control subsystem. 

Simulator Scaling Changes and Other 
Improvements 

Accuracy of the Phase II simulator is greatly 
improved over Phase I by the use of higher 
quality components and a broader voltage range 
which permits finer scaling. A scaling ratio of 
5°F per volt makes the Phase II simulator almost 
2 1/2 times as precise as the older one. Low- 
power-level operation can now be carried on down 
to 10 percent of rated plant power, well below the 
level where steam generation will begin in actual 



Figure 6. Completed Phase II Simulator with Main Problem Board in Place. 
Instructor Console at Far End of Room Next to Recording Equipment (right) 
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plant operation. The upper limit has also been 
extended by the scaling ratio revision, to as high 
as 150 percent power operation. 

All of the simulated components in the sodium 
loop benefit from the refined scaling of the 
simulator. Minor revisions have been made in 
scaled values throughout the simulator on the 
basis of data revised by APDA, and in the reactor 
simulation a circuit has been added to simulate 
scrams. 

While the Phase I study was in progress, 
Power Reactor Development Company decided to 
purchase crossflow, counterflow, once-through 
steam generators instead of the counterflow type 
originally planned, but the simulation remained 
basically the same as in Phase I, with the differ- 
ence in type being accounted for by introducing an 
equivalency factor deter mined from digital studies. 

Mechanical improvement in the stepping 
switch, improved electronic components, revised 
scaling ratio and a different method of calculating 
heat transfer coefficients all contribute to greater 
accuracy of the steam generator simulation. 
However, the major change in Phase II comes from 
incorporating in the steam generator simulation 
the effects of variations in pressure and flow rate, 
respectively, on the relationship of H 2 O tempera- 
ture and heat transfer coefficient with enthalpy, 
which were not considered in the Phase I study. 
These pressure and flow inputs to the steam gen- 
erator simulation are received from the steam- 
water hydraulics simulation. 

Steam System Hydraulics Simulation 

A complex representation of the hydraulics 
and pressure dynamics for feedwater and steam 
and their related components was undertaken in 
Phase II. The interplay of valves in the steam 
system required consideration of short response 
times of the order of 2 seconds for the turbine 
throttle valve and steam bypass (dump) valve. 

One steam loop as shown schematically in 
Figure 7, with associated piping and valving, is 
simulated on the assumption that all three loops 
operate identically. Included in the simulation are 
the feedwater pump, feedwater control valve, 
steam dump (bypass), turbine throttle valve, and 
auxiliary line for start-up; plus piping friction 
losses and flow inertia on both water and steam 
sides, and steam line storage due to compressi- 
bility on the steam side. Also included are the 
extremely important storage effects in the steam 


generator, in response to changes in flow and 
pressure. The turbine, condenser and condensate 
pump are not included. 

Considerable pioneering was necessary in 
devising the simulation of the foregoing elements. 
Prior to simulating steam system components, 
extensive analytical studies were necessary to 
evaluate all variables involved and determine the 
best approach to their simulation. 

Analysis of the steam system was worked out 
by electrical analogy (Figure 7). using resistors, 
capacitors, and inductors to represent frictional 
pressure losses, steam compressibility and fluid 
inertia, respectively. These analogous electrical 
elements were then simulated directly without 
going through the differential equation step, a 
relatively new technique explained by Larrowe,! 
that saves much time and effort initially and 
simplifies the task of making changes. 

Water storage effects in the steam generator, 
which account for a predominant part of the tran- 
sient response of the entire steam system, were 
a major aspect of the simulation. These storage 
phenomena cause steam generator output flow rate 
to lag input flow rate about 15 seconds, which 
significantly affects the transient characteristics 
of the system. 

Control Studies — Phase II 

On the basis of the conceptual design devel- 
oped by Holley in Phase I of this study, the control 
system supplier designed automatic control sys- 
tem hardware. These control elements were 
simulated in a more complete form than in Phase 
I, and stability of the system was investigated with 
these controls in operation, this time taking into 
account the important effects of steam-water 
hydraulics which had been omitted from the Phase 
I simulation. Results of these studies engendered 
further refinements in control elements and helped 
to determine optimum control settings. 

As mentioned earlier, the control system for 
the Fermi plant comprises two divisions: rod con- 
trol which regulates reactor output in response to 
power demand; and feedwater control which regu- 
lates water flow rate to match power output and to 
maintain a scheduled sodium temperature out of 
the steam generator. 

Rod Control. Early in Phase II considerable 
simulation work was done in finalizing the rod 
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FEEDWATER 


PUMPS 

SCHEMATIC DIAGRAM OF STEAM SYSTEM 



ELECTRICAL CIRCUIT ANALOGY FOR 
STEAM-WATER HYDRAULICS 
Figure 7. One Steam Loop Shown Schematically (above) and the 
Electrical Circuit Analogy Upon Which Simulation Was Based 
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control system. A large number of computer 
runs^* were programmed to illustrate over-all 
performance of the system in response to loss of 
signal and disturbances in reactivity, reactor inlet 
temperature, reactor coolant flow, and power de- 
mand set point. 

Feedwater Control. The function of the feed- 
water control is to regulate feedwater flow rate 
according to the requirements of reactor power 
and sodium system, while maintaining steam pres- 
sure constant. This control function is performed 
by manipulation of the feedwater control valve for 
flow control and the turbine throttle valve and dump 
valve for pressure control. 

The three main divisions of the feedwater 
control subsystem are an automatic control for 
the feedwater control valve, a pressure controller 
for dump valve and turbine throttle valve, and a 
pump speed control. 

Conversion to Operator Training Facility 

Following the basic systems analysis work, 
a control console (Figure 8) was built, duplicating 
the control panels at the Fermi plant, and 
connected to the completed analog simulator to 
convert the entire installation to a training 
facility for power plant operators. An important 
factor making it feasible to use this simulator for 
operator training is that all components of the 
plant were simulated individually, and therefore it 
is possible to instrument the training console for 
indications of input and output temperatures of the 
* Details of these performance tests by Holley 
have been published by others. See Reference 2. 


various components, or flow rate in different 
locations in the steam system, just as would be 
done in the operating control room of the actual 
plant. As already noted, the simulator was built 
to operate in real time to facilitate conversion to 
training purposes. 

In the training program now in progress, the 
simulated plant can be operated with either manual 
or automatic control, starting from a low level of 
power (30 MW) and carrying on into very high 
power ranges (450 MW). Both load and unload type 
operation, normal shut-downs, and abrupt shut- 
downs can be tried with either automatic or 
manual control. The console thus gives operators 
the feel of plant operation and allows them to 
develop and practice operating procedures. 

The training console, built in the exact size 
and configuration of the plant control console, 
shows all instruments and controls just as they 
will appear in the plant control room. Thus, in 
addition to items required for operation of the 
single loop simulation, corresponding items for 
the other two loops have been dummied on the 
panel. The operator becomes oriented in relation 
to appearance and location of all instruments, 
lights, control levers, etc. This familiarity is 
vital since he will be isolated from normal plant 
sounds and these indicators are his only avenues 
to awareness of plant operation. 

Nuclear Start-Up Training 
and Trouble Simulation 

Since the fuel cycle of the Enrico Fermi 
Atomic Power Plant necessitates frequent reactor 



figure 8. Control Console for the Power Producing Section of the Fermi Plant 
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start-up from source level, considerable practice 
in nuclear start-up and its various ramifications 
is absolutely necessary for plant operators. Using 
the simulator as a general purpose computer, a 
new problem board was set up to simulate the 
physical behavior of the reactor alone from 
source level — 1 watt power — through six decades 
up to 1 MW. 

The simulation was based on standard equa- 
tions describing the reactor on a one-point basis. 
Following the method outlined by Franz and 
Simcic, 3 the equations were transformed to 
expressions containing the logarithm of N (neutron 
population) rather than N, because the range of 
values for N could not be simulated within the 
usable voltage range of the computer. 

A trouble simulation board has also been 
developed, whereby an instructor can deliberately 
introduce plant malfunctions into the operating 
problem without knowledge of the trainees. 
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Simulator Capability 

The complete one-loop simulation of the plant 
is now available as a systems analysis tool for 
any further studies required, and serves at the 
same time as a training device which familiarizes 
plant operators with both manual and automatic 
operation. Both systems analysis console and 
operator training console are connected at all 
times, and the type of operation desired may be 
selected at will. 

The simulator is accurate to 1 percent, and 
with the main power problem board in place 
operating conditions over a range of 30 to 450 
MW power can be simulated. With the nuclear 
start-up patchboard installed, operation from 1 
watt to 1 MW can be simulated. 

The entire program has thus been able to 
meet the three basic requirements set forth at the 
beginning of the project. The facility in its present 
form will be transferred to the reactor site on 
or about May 15, 1960, and will become a vital 
part of the on-site training facilities. 


3. J.P. Franz and N.F. Simcic, "Nuclear Reactor 
Start-Up Simulation," I.R.E. Transactions on 
Nuclear Science , Mar. 1957, pp. 11-14. 

For complete details of this project write 
for a copy of Report 59-12 with addenda to: 

Robert Kley or Samuel Irwin 
Holley Carburetor Co. 

11955 E. Nine Mile Road 
Warren, Michigan 
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"ANATRAN" — FIRST STEP IN BREEDING THE "DIGINALOG** 

L. A. Ohlinger 

Chief of Computing and Datamation Center 
Norair Division, Northrop Corporation 
Hawthorne, California 


Summary 

A planned and controlled integration of dig- 
ital and analog computers could be effected by 
industry if the final general purpose "diginalog" 
were produced in steps commensurate with the devel- 
oping state of the art. The first step, described 
herein, is the development of an analog symbolic 
language and digital compiler such that the engi- 
neer can write his problems in quasi -mathematical 
and familiar language which the digital computer 
can then translate into a digital input function 
generation, a set of .calculated pot settings, and 
a complete analog flow diagram. Subsequent steps 
in developing the diginalog are also described. 

"The Hiatus Closed" - A 3 Act Play 
The Plot 

Kipling could well have been speaking of dig- 
ital and analog conputers when he said: never 

the twain shall meet". The gulf that has existed 
between digital and analog computing people has 
varied all the way from one of complete disdain to 
one of open antagonism between the two sects. Only 
rarely has some prophet raised his voice in suppli- 
cation for a union of these two remote members of 
the machina sapiens family. Still more rarely has 
the voice crying in the wilderness resulted in any 
actual cooperative effort or in any union of hard- 
ware between the general purpose medium or large 
scale digital machines and the general purpose an- 
alog equipment. 

Certainly, there has been little concerted 
effort toward bringing together these two powerful 
engineering tools in a planned union that will take 
advantage of the best features of each class of 
computer and that will lead ultimately to a single 
still more powerful machine - - the "Diginalog". 
True, the digital and analog machines live in two 
different worlds, they speak two different lan- 
guages, and the only family resemblance is that 
they are both largely electronic and do computing. 
However, each has its place in the general scheme 
of things and can do some things better than the 
other. 

Therefore, it is time to end the artificial 
segregation and take steps leading toward a mar- 
riage of these strangers with the hope of cross- 
breeding them selectively for their dominant char- 
acteristics and gradually eliminating the objec- 
tionable recessives. Although several generations 
of computers may be necessary to achieve the de- 
sired paragon, now is the time to take the necessary 
steps toward the union and cross-pollination. 


The Characters Involved 

The principal characters in today's plot acre 
Don Digital and Ann Analog, strangers to each oth- 
er and only distantly related. Don is an aggres- 
sive young machine who is developing rapidly into 
a powerful force in his community because of his 
many fine characteristics. He has a quick mind, 
faster than any human being we know, and he is 
extremely accurate. He can read and write at fan- 
tastic rates, do lots of arithmetic rapidly, and 
rarely makes an error because of his clever ways 
of checking himself. However, at the time of our 
story he can handle only one thing at a time and 
has not learned how to do integration very rapidly. 

Don's talents have been extended to many 
fields of science, engineering, and industry where 
his dominant characteristics are invaluable. He 
is equally at home performing complex engineering 
and scientific confutations, or in processing tre- 
mendous masses of business and industrial data 
such as for Finance or Manufacturing. 

In other words, Don's is an exact mathematical 
mind and, even though he reads and writes all al- 
phabetic and algebraic characters as well as many 
punctuational and mathematical symbols, he thinks 
of all such symbols in the language of numbers and 
performs his amazing feats in this language. His 
is the language of numbers. 

Ann, on the other hand, is the more retiring 
type, and lacks Don's aggressiveness. She has al- 
ways been the physical type and, accordingly, has 
learned to understand only the physical things in 
life. As might be guessed, she knows what to do 
with curves. She is a clever mimic and can simu- 
late other physical things very cleverly but she 
can express her thoughts only in physical form, 
and converts these thoughts to numbers only very 
crudely and with some difficulty. Hers is the lan- 
guage of physical quantities and curves. 

This does not mean that her mind is not facile, 
because actually she can handle a number of vary- 
ing ideas simultaneously and, with almost feminine 
intuition, arrive at an answer. Her mind is agile 
too, and she can arrive at a conclusion based on a 
number of varying facts almost instantaneously, 
although the answer may be somewhat approximate. 

Accordingly, Ann's forte is obtaining approx- 
imate answers instantly, performing complex jobs 
like integration instantly, and coming up with an 
answer to what happens when a lot of inputs are 
varying simultaneously or, as the engineers and 
scientists say, when there are several "degrees of 
freedom". 
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The Play Unfolds (in synoptic form) 

Prologue, Ann and Don have talked only on 
rare occasions because of the language barriers and 
with not too much success. It is difficult to carry 
on any kind of a courtship or even an intelligent 
conversation with a third person translating each 
statement and answer. In addition, it makes the 
courtship a long drawn out affair when each must 
sit idle and wait for the translator to convert onefe 
statements into a language understandable to the 
other. 

However, Dan Cupid will not be outdone and so, 
with the aid of the author, a new element is injec- 
ted into the plot — a new means of communication 
by which Don can speak to Ann, These, then, are 
the characters that we hope to bring together to- 
day, at least in the courtship stage, through a 
new proposed language we have named "Anatran". 

Act I — The Courtship, During the past few 
years, Don Digital has been learning some new meth- 
ods of translating the thoughts and symbols of homo 
sapiens into the only language Don understands so 
that he can take over part of this translation that 
used to be so frustrating when it was necessary to 
depend upon slow, inaccurate humans. At this time, 
he is capable of accepting an almost completely hu- 
man language and converting it, at his own rapid 
and accurate rate, to his own dialect so that he 
can handle the problem at his own fantastic speed 
and accuracy. 

However, there is one drawback — there must 
be some relationship between human language and the 
slightly pidginized computerese that is the pre- 
sent communication link between man and machine. 
For example, Don, in his 70U form, accepts Fortran] 
in his Uhivac form he accepts Mathmaticj and so on. 

Now, a new computerese titled "Anatran” has 
been conceived and is being studied so that Don can 
dialect, translate it into a sort of love letter — 
the analog flow diagram and set-up data that Ann’s 
attendants can use directly for Ann, This includes 
her pot settings, her wiring flow diagram, the num- 
ber of pots and linear and non-linear elements she 
needs, and even an easy means for the attendants to 
check that Don has not said anything inadvertent 
in his missive. 

At this point in Act I, Don's attendants, who 
have been studying analogese, have developed a sim- 
ple and easy means of producing the missive that 
will convey all this information to Ann's attend- 
ants for her consumption. Having developed the in- 
put and output of the new language, Anatran, Don’s 
attendants are now looking into the actual writing 
of the vocabulary and rules of syntax. (See the 
author's footnotes and comments for details of the 
actual missive.) 

It should be noted that, in the approved man- 
ner of courtship, Don is the aggressive one and not 
only takes the lead in communicating with Ann, but 
also controls Ann's actions. 


Act II — The Engagement. Inpatient and chaf- 
ing at the slowness of Ann’s attendants in trans- 
lating Don's missive for Ann, the idea is conceived 
of adding to Don's missive another communication 
medium in the form of a paper tape or magnetic tape 
on which all of the pot settings are listed. 

With the help of an automatic pot setter, all 
of Ann's pots are set automatically in a matter of 
minutes instead of the hours formerly consumed by 
her attendants. Thus a physical bond is establish- 
ed in which Don not only directs Ann's actions but 
actually furnishes to her a part of his own handi- 
work that enables her to perform her own marvelous 
feats more rapidly. 

Act III — The Marriage, There still exists 
one weak link in the union between Don and Ann — 
the slow, laborious task of wiring Ann's patch- 
boards from the instructions in Don's missive. With 
the assistance of a new technique of electronic 
switching and, perhaps, the help of some relays, 
means are found whereby the patchboard wiring by 
Ann's attendants can be eliminated. However, in- 
stead of relaying the instructions from this new 
automatic patchboard wiring system by means of the 
same paper or magnetic tapes, it is time for the 
actual marriage of these two, who have now learned 
to communicate and work together toward joint goals 
in life. 

Accordingly, a physical bond is made so that 
Don can send directly to Ann all of the instruc- 
tions that will not only set her pots automatically 
but also will perform all internal wiring automat- 
ically so that Ann is ready to perform her appoint- 
ed tasks quickly and efficiently with little lost 
time between jobs. As before, masterful Don "wears 
the pants" in the family and tells Ann what to do, 
because he is the stable and reliable one that can 
be depended upon to produce the same results time 
after time, whereas Ann, in true fendne fashion, 
feels that an approximate answer is good enough as 
long as she continues to handle the jobs for which 
she is best suited. 

It is an ideal union because Don is still free 
to do his tasks as he pleases without interference 
from Ann] he is free to call upon as much help from 
Ann as he needs to do those parts of his job for 
which she is best suited] and with it all, he can 
direct Ann and assist her in setting up and per- 
forming those jobs on which she is best suited to 
work alone. Would that all families in homo sapiens 
might emulate this marital status] 

Epilogue. The marriage of Don Digital and 
Ann Analog has brought together finally these two 
classes of machina sapiens into a unified system 
that exploits the best features of each of the in- 
dividual members but it still is a union of two 
existing systems and is not a single system de- 
signed specifically to exploit only the best fea- 
tures of both systems and eliminate those features 
that are less desirable or superfluous. 

Thus, the next generation will be the offspring 



that achieves this goal — the machine designed to 
perform either digital or analog operations or both 
in the most efficient manner possible. This is the 
"Diginalog". 

Footnotes and Comments 

The play is ended and our flight of fancy is 
over. However, the allegory contained therein is 
not one based on imaginary conditions or wishful 
thinking. It is founded upon actual facts coupled 
to new concepts that the author and his people at 
Northrop have developed. 

Retrospection 

Evolutionary Relationships. Probably one of 
the principal reasons that digital and analog com- 
puters have remained so far apart has been the his- 
torical development of these two classes of comput- 
ing equipment. Analogs, as the name implies, start- 
ed not as computers but as physical systems designed 
to simulate by analogy other physical systems whose 
characteristics and behavior under various condi- 
tions of operation were to be predicted. In this 
role, they served the engineering and scientific 
world very well, but their invasion into the math- 
ematical world always was limited by their very low 
accuracy and precision. Only with the advent of 
electromechanical and electronic equipment did the 
application of analog computers to the solution of 
engineering and scientific problems become of sig- 
nificant value. 

Meanwhile, digital computers, even in their 
earliest crude forms, have always had the advantage 
of accuracy but they have been comparatively slow. 
With the many advances in electromechanics and 
electronics, digital computers have advanced in 
speed and capability to the point that their uses 
are becoming increasingly widespread. However, 
there still have been two different classes of dig- 
ital computers developed — the calculators and the 
data processors, as indicated in Table I. The dif- 
ferences shown there in their characteristics re- 
sulted in differences in logical design and equip- 
ment. 

Fortunately, the continued use of calculators 
and data processors has led to the changes in char- 
acteristics indicated in Table II so that there no 
longer is any reason for the next generation of 
computers to have two different types of equipment 
for the two types of functions. We have now reach- 
ed that stage in the state of the art in which the 
next generation of computers and data processors 
will have one and the same machine performing both 
types of functions with equal facility. 

Earlier Marriages - Future Problem-Oriented 
Languajes (POL *8), Because of these different 
lines of development, the jurisdiction over the two 
classes of digital computers and over the analog 
computers has almost always been separate and un- 
related, with each type of equipment under the con- 
trol of a different group in the organization. 
Thus, there was little opportunity or incentive to 
bring these different machines together in any at- 
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tempt to improve the efficiency of solving problems 
by exploiting only the best features of each class 
of computer in a joint solution. In a few cases, 
small analog-to-digital (and vice versa) converters 
permitted a limrlted amount of intercommunication 
by digitizing analog information or interpolating 
discrete digital information into continuous form. 
Convair with its application of DATRAC is an exam- 
ple of interposing a translator. 

Ramo-Wooldridge with its large converter that 
connected the 1103A and the analog equipment, is 
another example of the form earlier attempts have 
taken in interconnecting physically these two class- 
es of equipment. However, the problems of such an 
approach to digital/analog operation are fairly ob- 
vious when neither equipment dominates the picture. 
Program interruption on the digital machine when 
called for by the analog is not always a desirable 
mode of operation nor is it usually economical to 
let either class of equipment stand idle, waiting 
for the other to be available. 

The new concepts proposed herein would not be 
possible were it not for the advent of the newer 
types of internally programmed electronic data pro- 
cessing machines (EDPM) that operate at high speeds 
and, particularly, those that permit the perform- 
ance in parallel of several functions simultaneous- 
ly. It would not have been possible either were it 
not for the ingenious development of the new POL’s 
and their digital translators that permit these 
languages to be converted for and accepted by a 
digital computer that speaks in a different dialect. 
The fast memory and high speed of the new EDPMmakB 
it possible for this equipment to accept POL's that 
express the problem in quasi-mathematical or engi- 
neering form and translate these into machine lan- 
guage for solution on that particular digital com- 
puter. 

Northrop 's Experience 

Fortunately, at Northrop, the basic philosophy 
of computing and data processing has taken a dif- 
ferent turn from the segregation described above. 
Located in the heart of the new Engineering, Sci- 
ence, and Research Building is the Computing and 
Datamation Center referred to as COMPADAC for sim- 
plicity. It includes a large analog installation 
with precision EAI equipment, an IBM Type 70UEDPM, 
the necessary EAM equipment for support of the 70U, 
and miscellaneous other equipment. (By December 
I960, the 70U will be replaced by an IBM Type 7090 
or a TRANSAC S-2000.) Its functions and responsi- 
bilities run the entire gamut of computing and data 
processing including digital and analog calculation, 
analog simulation and model testing, engineering 
and flight test data reduction, business data pro- 
cessing, reconnaissance/intelligence data handling, 
and even a moderate amount of research and devel- 
opment. Because these various functions have been 
centralized under one authority, it has been possi- 
ble to enforce a greater integration of the various 
types of functions with a resulting improvement in 
efficiency and capability. 

For example, every former analog analyst has 
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been thoroughly trained in digital programming, and 
every digital programmer has been trained in the 
programming and use of analog equipment so that 
there exists no language or specialization barrier 
between the people who are responsible for these 
two classes of computing equipment. The only pro- 
blem, then, is one of having the programmer decide 
which equipment to use for the solution of a parti- 
cular problem and how best to get the answers rap- 
idly and economically. 

This has not always been so, and the author 
can remember the time when the rivalry between 
these two factions was great. It even crept into 
the lunch hours when digital and analog programmers 
played chess and proudly announced to the author as 
he went by, "SeeJ analog vs. digital J". However, 
through the forced integration of these two fac- 
tions by the Chief of COMPADAC, this rivalry has 
been eliminated and the effort now is toward better 
use of the equipment, either independently or jointly. 

The latest development along this line has 
been the concept of a series of progressive steps 
leading toward a better integration of the two 
classes of equipment than has been achieved in the 
past by plain converters. This new concept will 
ultimately result in the "Diginalog" (which will 
combine the best features of each class of equip- 
ment without the superfluous or objectionable fea- 
tures), but it will not spring up full blown. Rath- 
er, it will develop as a series of logical steps 
commensurate with the current and changing state of 
the art. 

On To Diginalog 

Anatran - The First Step. The first of these 
steps (which is the basis for this paper) is the 
development of a problem-oriented language that will 
permit translation on a digital computer of the 
complete input information for an analog computer. 
This new language has been termed "Anatran* 1 (ANA log 
TRANslation) . 

The first step in developing Anatran was to 
determine the feasibility of producing an accept- 
able output from Anatran on existing equipment — 
a wiring flow diagram for the analog computer such 
that one of the programmers could wire a patchboard 
for the analog computer directly without further 
translation or interpretation. The conventional 
flow diagram produced by humans for the transition 
of the problem statement into patchboard wiring is 
typically a symbolic wiring diagram involving cir- 
cles, triangles, rectangles, etc., as indicated in 
Fig. 3. To produce such a diagram on current EDPM 
would involve a number of technical problems such 
as the introduction of new print characters on the 
output printer and the storage of the complete dia- 
gram with interconnecting lines while selecting in- 
dividual and unrelated areas for printing one line 
at a time, as most digital output printers do. 

Accordingly, a new type of wiring flow diagram 
was developed based upon the dc operational ampli- 
fier or the non-linear element as the fundamental 
unit. In this new flow diagram format, the print- 


out format is arranged to include the inputs and 
outputs of each such basic element as a complete 
entity with that element, as indicated in Fig. b. 
This will become clearer as we follow the details 
of the new type flow diagram and compare it with 
the earlier form shown in Fig. 3. 

Concurrent with the development of the new 
format, it was found possible to include on this 
wiring flow diagram a complete list of all poten- 
tiometer settings in numerical order so that the 
diagram can also be used for manually setting the 
pots on the analog equipment. In addition, by in- 
cluding the origin of all inputs and the destina- 
tion of all outputs, this same diagram becomes the 
means whereby the programmer or analyst can check 
the accuracy of the formalized flow diagram pre- 
pared by the digital computer. This will become 
clearer as we follow the diagram in detail. Final- 
ly, the diagram also provides a notation of the 
total number of potentiometers and linear and non- 
linear elements required by the problem. 

A Typical Problem. Now let us look at a typ- 
ical problem to be solved on an analog computer — 
that of determining the stability of an aircraft , 
in steady horizontal flight, with and without an 
autopilot, as the tail is struck by a vertical gust 
of short duration. The equations are shown in 
Fig. 1 together with the coefficients and other 
data on the input parameters in Fig. 2. 

Normally, this problem would be programmed 
either by the originating engineer (if he were one 
of those trained by COMPADAC) or by one of C0MPADAC& 
programmers. From the mathematical statement of 
the problem, a flow diagram would be constructed 
showing the complete step-by-step solution of the 
problem and closing all loops. All potentiometer 
settings, scale factors, voltage levels, and ini- 
tial conditions would be determined and listed so 
that the potentiometers and other equipment could 
be set up for proper solution of the problem. A 
patchboard would be wired, checked, and put into a 
control console in the equipment. When all of the 
setup had been checked, the machine would be turned 
on for solution and the results recorded on six- 
channel strip chart recorders, changing input con- 
ditions as desired. 

Now let us refer again to Figs. 3 and b. The 
flow diagram in Fig. 3 is the manual product that 
will normally result from the engineer’s or pro- 
grammer’s work except for the fact that the equa- 
tions, pot numbers, amplifier numbers, and output 
destinations would not be shown normally on the 
diagram. The sophisticated programmer does not 
need such data on the diagram and they are shown 
here for the reader’s convenience in following, in 
detail, the conparison between Figs. 3 and b. The 
numbers within the circles are always gain factors 
while the numbers within the triangles are the 
voltage levels for the amplifiers and represent 
the number of volts per unit of the quantity pro- 
duced by 'the amplifier. 

Fig. b is the corresponding wiring flow dia- 
gram, listing of pot settings, check sheet, and 



notation of the number of pots and functional com- 
puting components required, as it will be produced 
by the proposed Anatran. This will be the final 
product of the 70b (or other EDPM) output printer. 
Once familiarity with the format has been realized, 
no headings are really necessary for the columns 
since their content is almost self -defining. How- 
ever, it is no problem for Anatran to produce the 
necessary headings over the various columns and 
over the various groups of columns that constitute 
a functional field on the diagram. Pre-printed 
paper is unnecessary and would be impractical if 
the analog problems were mixed in with the digital 
problems handled by the EDPM. 

The similarity between Figs. 3 and b may not 
be apparent on first observation but a little ex- 
planation should make the principles of Fig. It 
perfectly clear. We know this because we have 
tried out the new format on two different program- 
mers who were unacquainted with its development and 
who haul to translate its contents into patchboard 
wiring with only a brief explanation of the rules 
of format. One of these had originally been train- 
ed in digital programming and the other in analog, 
although each does both classes of work now. Each 
of these men found the format easy to grasp and 
simple to apply accurately. 

let us follow the flow diagram and tabular da- 
ta in Fig. I*, explaining the ground rules as we go. 
The first six columns list in sequence the inputs 
to ary amplifier or non-linear element in the form 
of a name for the quantity depicted, its source, 
and the various data connected with potentiometer 
settings and the input. The next two columns in- 
dicate the voltage level and type of analog compo- 
nent involved, together with its identifying number: 
In this case, the numbers are assigned serially 
while the letters are assigned according to the 
function of the analog component according to the 
code listed at the bottom of Fig. b, The last two 
columns indicate the output of the analog component 
by name and destination. 

The diagram is built up one component at a 
time, with the component name or identification 
letter and number listed oposite the last input to 
that component as indicated in Fig. b. Following 
the complete input and output story of any element, 
a line is skipped to indicate the change to the 
next element and the new inputs are listed. 

Notice that where no potentiometers are in- 
volved, such data is omitted. The components in 
Figs. 3 and U have been lettered and numbered iden- 
tically for exact comparison, and the reader can 
trace through the diagram shown in Fig. b by com- 
parison with the corresponding steps in Fig. 3. 

The reader will observe in Fig. It that the po- 
tentiometer numbers and settings are listed simply 
and clearly in columns 3 and b so that the diagram 
constitutes a list for the potsetting operation. 

He should also observe that the "component count" 
table at the bottom of Fig. I: indicates the number 
of elements required to solve that particular pro- 
blem. 
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The wiring can be accomplished by patching 
cords from the output destinations in the last col- 
umn to the corresponding input information in the 
input field. Notice, too, that this arrangement of 
a tabular form of flow diagram permits easy check- 
ing of the solution by the programmer to make sure 
that all loops are closed and all inputs and out- 
puts included. 

In order to extend these same principles de- 
veloped in this simpler form of stylized flow dia- 
gram to other analog components, comparable flow 
diagrams were developed for other problems involv- 
ing servo-multipliers, resolvers, ungrounded pots, 
and diode limiters. The problem statements, and 
corresponding manual diagram and EDPM-produced flow 
diagram are shown in subsequent Figs. 5 and 7. As 
the reader compares the corresponding diagrams, the 
extension and application of the same principles 
described before will become self-evident. 

Having determined that it is feasible to pro- 
duce on existing EDPM (without modifying that equip- 
ment) a suitable flow diagram, listing of pot set- 
tings, easy checking diagrams, and a notation of 
the number of components required, the next step is 
the development of the vocabulary and syntax re- 
quired to translate engineering and quasi-mathemat- 
ical statements such as shown in Figs. 1 and 5 into 
the output that has been shown in Figs. U and 7. 

The development of the Anatran translator program 
itself is our next goal. This is the "courtship" 
of our allegory. 

Mechanization. The next step in automatizing 
the setup of general purpose analog equipment by 
use of a digital computer requires no new digital 
equipment and is also commensurate with the state 
of the art. Currently, the manual setting of po- 
tentiometers (which is a laborious task that is 
time consuming and easily subject to errors) is 
being replaced at Northrop by an automatic potset- 
ting device employing punched paper tape as its 
input information source. As the digital computer 
computes the potsettings, scale factors, gains, and 
voltage levels, and develops the end product shown 
in Figs. U and 7, it is possible for it to produce 
punched cards or punched paper tape that contain 
just the potsetting information. Cards can be con- 
verted into punched paper tape (or even magnetic 
tape) format so that the tape can be fed to the au- 
tomatic potsetting device and thereby eliminate the 
manual step of setting pots. This is the "engage- 
ment" referred to in our allegory. 

Means already have been developed for elimi- 
nating the analog patchboard wiring by the use of 
automatic crossbar switching, but such equipment is 
very large and expensive and difficult to justify. 
The author and his people have ideas about the pos- 
sibility of electronic switching devices in matrix 
arrangement (with or without relays) that will per- 
form the same functions as patchboard wiring more 
rapidly and less expensively, but this will be the 
subject of a later paper. This is the "marriage" 
referred to in our allegoiy. 

When this marriage is consummated, it will be 
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possible for the digital computer to determine the 
entire course of the solution of a problem on both 
digital and analog equipment. If the problem is 
one for which the digital machine is best suited, 
it can solve the problem without recourse to the 
analog, but it can, meanwhile, set up the analog 
equipment to solve problems that are best suited 
to analog solution while it is doing its own work. 
Thus, Doth kinds of equipment will be kept busy un- 
der the control of the digital machine. Problems 
that the analog portion of the united equipment 
might solve would include "quick and dirty" solu- 
tions that do not warrant greater accuracy and pre- 
cision, parametric studies that involve many cases, 
and multiple degree of freedom problems. 

However, in those cases in which the problem 
calls for integration or some of the other functions 
peculiarly suited to analog equipment, the digital 
machine can perform its own part of the solution 
and call upon the analog to do the integration or 
other function when and if required. 

Extrapolation 

One can extrapolate the principles of this 
united digital/analog computer into the "Diginalog" 
in which the logic of the machine is such that only 
the best features of each class of computing equip- 
ment are exploited and all other superfluous or re- 
dundant features are eliminated. 

Of course, with crossbar switching, vacuum 
tube or relay switching, or matrix electronic 
switching, it is possible to retain the individual 
identity of the digital and analog machines as des- 
cribed in the "engagement" stage above, by utiliz- 
ing punched paper tape or magnetic tape output from 
the 70 U as the input information for controlling 
the automatic patchboard wiring device. Such a 
technique would postpone the "marriage" if the lat- 
ter were uneconomical compared to the cost of the 
added manual step in moving tapes from the EDPM to 
the analog machine. 

Likewise, the digital differential analyzer 
(DDA) can perform some of the desirable tasks of 
the conventional analog with dc operational ampli- 
fiers while improving the accuracy and repeatabil- 
ity of the solution. Therefore, it might be equally 
desirable to develop the interconnection first by 
use and later by hardware of EDPM and DDA equipment 
much as has been described in the text above. The 
principles of developing the interrelation will be 
the same. 
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TABLE I 


THE FIELD OF COMPUTERS 


Prior to 1959-1960 


Amount of 
Data Handled 

Amount of 
Calculating 


Usual Type 
Usual Size 


Calculators 

Data Handlers 

Simulators and 
Model Testers 

Controls 

Small 

Large 

Varies 

Varies 

Large 

Small 

Varies 

Varies 

Digital and 
Analog 

Digital 

Analog 

Analog and 
Digital 

Small to 


Small to 

Small to 

Large 

Large 

Large 

Large 


TABLE II 


THE FIELD OF COMPUTERS 


After 1959-1960 


Amount of 
Data Handled 

Amount of 
Calculating 


Usual Type 
Usual Size 


Calculators 

Data Handlers 

Simulators and 
Model Testers 

Controls 

Large 

Large 

Varies 

Varies 

Large 

Large 

Varies 

Varies 

Digital and 
Analog 

Digital 

Analog 

Analog or 
Digital 

Small to 


Small to 

Small to 

Large 

Large 

Large 

Large 


(For Interpretation Of These Tables, Refer To Text) 
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longitudinal Motion Of An Aircraft With 
And Without Control By An Autopilot 


Equations Of Motion 

u * -Au -Bw -B0 

w » -Du -Ew +G© -H2> 

0 - -Ju -Kw -Lw -MB -NS +F X 

Autopilot Equation 

1 - +P9 +R0 

Load Factor Equation 

An » S0 -Tw 

Altitude Equation 

h - -V0 +Yw 

Definition Of Symbols 

u - Forward velocity of the aircraft 
u ■ Forward acceleration of aircraft 
w ■ Vertical velocity of the aircraft 
w = Vertical acceleration of aircraft 
0 ■ Pitch angle of the aircraft 


(This is the gust disturbance of forcing function) 


5 ■ Total control surface deflection 

h = Altitude of the aircraft 

n » Load factor 

Ft = An input disturbance to pitch accelera- 

tion 


Notation 

Capital letters A, B, represent constants that will be determined and put into the 

equations for each set of conditions in the problem. 

The single dot over a symbol represents the rate of change of the quantity represented by that 
symbol j e.g., u « rate of change of the velocity u. 

The double dot over a symbol represents the rate of change of the quantity represented by the 
single dotted symbol j e.g., e « rate of change of the "rate of change” of the pitch angle 0. 


MATHEMATICAL STATEMENT 
OF A TYPICAL PROBLEM 


FIG. 1 



Pot Set Scaling With Positive 


COEFFICIENT 

FACTORS 

Symbol 

Value 

Scale 

Gain 



(Voltage) 

(Resistor) 

A 

.0059 

1 

0.1 

B 

.0366 

1 

0.1 

C 

32.20 

100 

0.5 

D 

.1030 

1 

1 

E 

.kk80 

1 

1 

G 

536.0 

100 

10 

H 

27.50 

100 

1 

J 

.oook5o 

0.01 

0.1 

K 

.0122 

0.01 

5 

L 

.0000037 

0.01 

o.o5 

M 

.3500 

1 

5 

N 

1.270 

1 

2 

P 

1.500 

1 

10 

R 

1.500 

1 

10 

S 

17.6k 

100 

0.2 

T 

0.031 

1 

0.1 

V 

536.0 

1000 

1 


SET UP OF TYPICAL ANALOG 


POTENTIOMETER 
Identity Setting 

1 .0590 

2 .3660 

3 ,6UUo 

h .1030 

5 .Ub 80 

6 .5360 

7 .2750 

8 .k5oo 

9 .2kOO 

10 .007k 

11 .0700 

12 .6350 

13 .1500 

lk .1500 

15 .8320 

16 .3110 


17 


5360 








































INPUT 


AM?LIFIEF OP NON- 
LI NEAP ELEMENTS 

OUTPUT 

DU ANT I TV 

SOUF'CE 

POTENT I ONE TEP 

NO . SETTING GAIN 

SC RLE 

FfiCTDP 

UOLTftGE ELEMENT 

leuel AND NO. 

QUANTITY DESTINATION 


-u 

I 1 

1 

0590 

0. 1 1 





-W 

S 4 

2 

3 6 b 0 

0. 1 1 





-THETA 

I 7 

3 

6t40 

0.5 100 

1 I 

1 

-u 

PI , F'4, P8/IPC 1 

-U 

I 1 

4 

1 030 

1 1 





-Ijl 

S 4 

cr 

4 480 

1 1 





+THETAD0T 

S 6 

■s 

5 38 0 

1 0 1 00 





-DELTA 

s 9 

7 

2750 

1 1 00 

1 S 

- 

-WDOT 

I3.P10.P16 

-UID07 

S 2 



1 

1 I 

3 

+W 

Sh. 1 1 1,6 PC 1 .. 2 : 

+ UI 

I 3 



1 

1 S 

4 

-W 

P2.P5.P9 

-U 

I 1 

g 

4 500 

0.1 0.01 





-w 

S 4 

Q 

244 0 

5 0 . 0 1 





-III dot 

S 2 

1 G 

0074 

0 . 0 5 0 . 0 1 





-THETADOT 

T =; 

1 1 

0700 

5 1 





-DELTA 

S 9 

1 2 

6 350 

2 1 





FI 




1 1 

1 00 I 

5 

-THETADOT 

P 1 1 .. S6 

-THETADOT 

I 5 



1 

1 00 s 

6 

+THETADOT 

P6 .. I7.P1h.P1I 

+THETAD0T 

S 6 



1 

1 00 I 

7 

-THETA 

F' 3 . S8 , F 1 7 

-THETA 

I 7 



1 

1 00 s 

8 

4THETA 

PI 3. CPC 1 . 39 

+THETA 

s e 

1 3 

1 500 

10 1 





+THETAD0T 

S 6 

1 4 

1 500 

10 1 

1 00 s 

y 

-DELTA 

F'7,F'12 

+THETA00T 

S 6 

1 5 

8 320 

0.2 100 





-WDOT 

5 - 

1 6 

3110 

0. 1 1 

1 s 

1 0 

-DELTAN 

CPC 1.49 

-THETA 

I 7 

1 7 

5 360 

1 1 000 





+W 

I 3 



0.1 1 0 

0. 1 I 

1 1 

+H 

CPC 1 . 59 


POTENT I OMETEP 


AMPLIFIERS M LI L ■ T I P L I E P S 


PE SOLO EPS 

PEC OF DEF'S 



1 7 


1 1 

0 


0 

i 


LEGEND: 


An 

■ High Gain Amplifier (n»no. ) 

NLPnt - 

In 

■ Integrator (n=no.) 

Pn 

Mn 

« Servo Multiplier (n=*no.) 

(RCn,c) » 

Sn 

* Summer or Sign Reverser (n»no.) 

Rn,c - 



Figure h 


Non-Linear Potentiometer (n=no. ,t«tap(T-top,C«center,B=>bottom) ) 
Potentiometer (n=no.) 

Recorder (n=no. ,c**channel) 

Resolver (n»no.,c=output channel (1 or 2)) 


ANATRAN OUTPUT 


(Wiring block diagram, list of pot settings, checking diagram and 
component count. Produced on 70U or other EDFM output printer.) 
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2 4 6 8 

R - RANGE 

(THOUSANDS OF FEET) 


V T * TARGET VELOCITY 
V M - MISSILE VELOCITY 
6 - LEAD ANGLE 
R - RANGE RATE 

0 - TARGET ANGLE WITH RESPECT TO RANGE 

EQUATIONS: 

R - V T COS0- V M COS 0 
0 * (V M SIN 0 - V T SIN 0 )/R 
CASE 1: VOLTAGE LEVELS: 


V T - 500 FT/SEC 
V M - 700 FT/SEC 

R » 4000 FT 
o 

0 - 0.5 RADIAN 

o 


V T O.lv/UNIT 
V M O.lv/UNIT 
R O.Olv/UNIT 
R O.lv/UNIT 


0 0.5v/DEGREES 


0 lOOv/RADIAN 


Figure 5 

A COLLISION COURSE SIMULATION PROBLEM 



Figure 6 

ANALOG BLOCK DIAGRAM - MANUALLY PRODUCED 
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NON-LINEAR POT; 

TOP, CENTER, BOTTOM 


















INPUT 


OUTPUT 


AMPLIFIER OR NON- 
LINEAR ELEMENTS 


OUANT I TV 

SOURCE 

POTENTIOMETER 
NO. SETTING 

SCALE 

GAIN FACTOR 

VOLTAGE 

LEVEL 

ELEMENT 
AND NO. 

QU ANTITV 

DESTINATION 

+ 1 00 

REFVOLT 

1 7000 

1 0.001 

0. 1 

S 1 

-VSUBM 

NLP IB, S2 

-VSUBM 

S 1 


1 

0. 1 

S 2 

+ VSUBM 

NLP IT 

+0.5THETA 

FG 1 







+ VSUBM 

S 2 

NLP IT 



R 1 . 1 

-VMCOSTHETA 

S3 

-VSUBM 

S 1 

NLP I B 



P 1.2 

-VMS INTHETA 

S5 

-VMCOSTHETA 

p 1 . 1 


1 





+ V rCOSF'H I 

R 2, 1 


1 

0. 1 

S 3 

-ROOT 

14 

-ROOT 

S 3 


0. 1 





- 1 00 

REFVOLT 

2 4000 

IC 0.0001 

0 . 0 1 

I 4 

+P 

FG 1 , M3 CRC 1,19 

-VMS INTHETA 

P 1,2 


1 

0. 1 

S 5 

+VMSINTHETA 

A9 

+ 1 00 

REFVOLT 

•3 5000 

1 0.001 

0. 1 

S 6 

-VSUBT 

NLP2B , S7 

-VSUBT 

S 6 


1 

0. 1 

S 7 

+VSUBT 

NLP2T 

+PHI 

I 10 

4 2865 

0. 005 


R 2, 1 

-VTCOSPH I 

S8 






R 2,2 

-VTSINPHI 

A 9 

-VTCOSPH I 



1 

0. 1 

S 8 

+VTCQSPHI 

S3 

+ R 

I 4 







-PHIDOT 

A 9 

NLP3T 






+ 0 


NLP3C 






OPEN 


NLP3B 



M 3 

-RPHIDOT 

A 9 

+ VMS INTHETA 

S 5 


1 





-VTSINPHI 

P 2,2 


1 





-PPH I DOT 

M 3 


1 

1 000. 

A 9 

-PHIDOT 

NLP3T , 110 

-PHIDOT 

A 9 


0. 1 





- 1 00 

REFVOLT 

5 5000 

IC 1 

100. 

I 10 

+PH I 

P4 


POTENTIOMETERS 

AMPLIFIERS 

MULT I PL I 

ERS 

RESOLVERS 

RECORDERS 


tr 


1 0 

1 


2 

1 


An 

LEGEND: * n 
Mn 

Sn 


High Gain Amplifier (n»no. ) 
Integrator (n»no. ) 

Servo Multiplier (n«no.) 

Summer or Sign Reverser (n«no.) 


NLPnt • Non-Linear Potentiometer (n-no.,t-tap(T-top,C-center,B-bottom)) 

Pn * Potentiometer (n-no.) 

(RCn,c) - Recorder (n-no., c-channel) 

Rn,c - Resolver (n-no.,c-output channel (1 or 2)) 


Figure 7 
ANATRAN OUTHJT 

(Wiring block diagram, list of pot settings, checking diagram and 
component count. Produced on 70li or other ED PM output printer.) 


(Compare with Fig. 6) 
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MAN-TO-MACHINE COMMUNICATION AND AUTOMATIC CODE TRANSLATION 


A. W. Holt and W. J. Turanski* 


Summary 

The ACT system is a programmed adjunct to 
a general purpose computing machine (currently 
considered for the MOBIDIC )whose purpose is to 
facilitate the initial encoding and subsequent 
application of specific "code-to-code transla- 
tion n procedures. The phrase "code-to-code 
translation" is primarily meant to cover the 
conversion of problem oriented pseudo-codes in- 
to machine code equivalents. The ACT system 
provides for the "housing" of many distinct 
translation procedures - carrying algebraic, 
data-processing, simulation languages, etc., 
into any of a variety of computer codes - with- 
in the bounds of a single controlling system. 

The programmed components of the ACT sys- 
tem include: 

1. An allocation interpreter - AI 

2. A "core library" of basic translation 
functions 

3. A general translation library whose 
content, varies with use of the system. 

The allocation interpreter permits the 
formation of large translation programs out of 
large (and small) pre-stored sub-programs. 
Without greatly impeding computation speed, AI 
determines internal and external storage allo- 
cations for all space-taking portions of infor- 
mation as part of problem computation. This 
facility is shown to be essential to the con- 
struction of ACT, and perhaps widely useful in 
other types of programs. 

The core library contains translation 
functions which play a fundamental role in a 
wide range of code-to-code translations. In- 
cluded are many functions which usually play 
an important part in machine assembly programs. 
Other functions provide for general library 
handling, code reordering, etc. 

Also associated with ACT is a writing con- 
vention called "canonical form" which governs 
the presentation of data (i.e. , programs in 
source code submitted for translation). Ca- 
nonical form, while flexible enough to permit 
highly diverse problem codes, still standardizes 
all those signals which the basic ACT components 
decode. 


The ideas presented herein were developed 
at the University of Pennsylvania under contract 
to the U.S. Army Signal Corps in connection 
with the FIELDATA family of data-processing 
equipment. Although reference is made to the 
FIELDATA family in the presentation below, the 
ideas should be applicable to other families 
of equipment. Because of the prevailing ten- 
dency towards modular design in computing sys- 
tems, such families are rapidly becoming more 
numerous. 1*2,3 

General Considerations 

The Problem 

The ACT system, is designed to reduce the 
man-to-machine communication problem in oper- 
ating the Army FIELDATA system. The fundamen- 
tal assumptions with regard to this communi- 
cation problem are: 

1. that the FIELDATA system employs a 
variety of different digital computers } 

2. that the computational problems to 
be solved with these machines belong to many 
different "problem areas" - e.g., trajectory 
computation, estimation of statistical param- 
eters, information retrieval, language trans- 
la ti on j etc. 

3. that the exigencies of time and cir- 
cumstance may make it necessary to run any of 
the intended range of problems on almost any 
of the intended range of machines ; 

1*. that the overwhelming majority of 
Army personnel concerned with these problems 
will not be computer programming experts for 
any (let alone all) of the intended computers} 
and 

5. that the efficiency of the entire 
FIELDATA system hinges primarily on the speed 
of problem solution - i.e., the total "lapsed 
time" from problem statement till solution de- 
livery. Therefore, the speed with which a prob- 
lem can be correctly encoded for computer 
solution may matter a good deal more than the 


* The work described in this paper was done for the U.S. Army Signal Research and Development Labo- 
ratory, Fort Monmouth, New Jersey, as part of a "multiple task" project at the University of Penn- 
sylvania. The aurhors were engaged in this work by sub -contract of Remington Rand Uni vac, Phila- 
delphia, Pa., to the "Common Programming Language" task of the above project. 
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efficiency of the machine program with which the 
computation is finally performed. 

Existing Techniques 

The man-to-machine communication problem 
has been widely recognized and seriously attack- 
ed in many of its special instances. Most of 
the present day "automatic programming systems" 
are the result of manufacturers ' attempts to 
narrow the gap between "the problem" , which ex- 
ists in the minds of technical specialists, 
and "the program" which "causes" the computer 
to compute a solution. The invention and con- 
struction of an automatic programming system 
normally involves two major steps: 

1. The invention of a special problem code 
(often referred to as a "pseudo-code" )which: 

(a) is compatible with already es- 
tablished habits of expression within some 
technical specialty, and 

(b) is sufficiently formal and complete 
to permit automatic conversion - or "transla- 
tion" - into a computer -coded program. 

2. The construction of a special trans- 
lating routine which accepts as input data 
problems represented in problem code and pro- 
duces as output corresponding machine programs. 

Most commonly, the translating routine is 
designed to accomplish its entire task before 
computer solution of the stated problem begins. 
Thus, bringing a problem to solution with the 
help of an automatic programming system is 
typically a two-stage process - translation 
followed by computation. (See figure 1). 

Automatic programming systems which func- 
tion as in Figure 1 depend upon three codes: 

S, V, T. They are usually produced by manufac- 
turers; as adjuncts to particular computers. 

Thus, "Computer V" and "Computer T" of 
Figure 1 are almost always one and the same. 

If , for example , the manufacturer is putting 
a "scientific machine" on the market, he will 
usually design an algebraic-mathematical pseudo- 
code to go with it, and he will prepare the 
translating routine to produce programs for 
itself on that computer. 

Past experience with the building of auto- 
matic programming systems has brought to light 
several facts which have a strong bearing on 
the FIELDATA man-to-machine communication prob- 
lem: 

1. To produce an automatic programming 
system (S,V,T) normally requires major pro- 
gramming effort - measured in tens of man-years. 

2. Such systems are usually very difficult 
to modify in response to changes in the specifi- 
cations for S, V or T. This difficulty arises 
because the translating routine, within which 
the specifications for S, V, and T are "frozen", 
is usually a large computer program from which 
it is impossible to 'factor 1 the components, 
S,V,T. 

3. As a consequence of the immediately 
preceding point, it is further true that a 
translating routine which has been programmed 
for the triple (Si, Vi, Ti), does not give a 


substantial head start for producing the routine 
for (Si, Vi, T 2 ) or for (S 2 , Vi, Ti). 

FIELDATA Requirements 

The requirements for FIELDATA computing, 
as set forth at the beginning of this intro- 
duction, make it clear that: 

1. The operation of the FIELDATA compu- 
ting facility will require a large variety of 
source codes available for easy use in a rich 
variety of technical specialties. 

2. A given source code will have to be 
translatable into any one of several target 
codes. 

Some more detailed discussion of both 
points follows. 

Problem Code Flexibility 

It is a well known fact that each techni- 
cal field has associated technical "jargon" - 
special vocabulary as well as special forms of 
expression - which belongs to it and it alone. 
This specialization of language resembles the 
specialization of computers for particular pur- 
poses. In general, special computer problem 
codes for statisticians, physicists, psycholo- 
gists, business administrators, operations 
researchers, etc., have not been developed be- 
cause the art of automatic programming is young 
aid the building of new systems very expensive. 

If the cost of translating routines were not 
an issue, it would* be desirable to provide each 
technical field with coding conventions which 
most nearly suit its established habits of ex- 
pression. 

Quite apart from making provisions for 
presently fixed habits of expression, the cre- 
ator of a translating routine must also note 
that habits of expression change with time 
and circumstance. As the constellations of 
problems to be solved undergo change, so do 
the related language habits. Old words are put 
to new uses \ new words are invented ; abbrevi- 
ations are created to refer to oft-repeated 
combinations } previously disjoint sets of nota- 
tions must be combined, etc. Thus, expression 
forms change in response to changing needs. 

In many important cases the expressions lead 
to computable problems. The speed with which 
answers appear depends heavily on the amount and 
kind of hand '"recoding"’ (or "encoding") of the 
original problem statement before the computer 
can "take over" and ultimately execute the prop- 
er sequence of computer steps to achieve a prob- 
lem solution. 

Computers - Variety and Compatability 

No matter how similar two computers may 
seem, they always turn out to be distressingly 
different in relation to some problems. Consid- 
er, for example, two computers, C and 2C, which 
differ from each other in no respects other than 
that 2C has twice the internal storage capacity 
of C. For all those problems easily accomplished 



by a single run on C, Computers C and 2C are 
fully equivalent. For those problems which in- 
volve more than one run on C - or perhaps the 
use of overlays - Computer 2C is equivalent to 
Computer C in only the sense that those origi- 
nally programmed for C will also run on 2C. 

Thus, for such problems, a major task of con- 
version from one computer to another may be en- 
countered if 

1. the problem was originally programmed 
for 2C, or 

2. the problem was originally programmed 
for C but must be made to run efficiently on 
2C. 

So-called "compatibility" between machines (in 
the sense of very similar or identical instruc- 
tion codes) does not guarantee ease of program 
conversion from one computer to another. Only 
small problems will convert re a dily, while big 
problems will offer difficulties not much less 
formidable than when "incompatible" machines 
are considered. Although the FI ELD AT A computers 
are being designed with mutual compatibility 
in mind, significant differences between one 
translating routine (S, V, T^) and another 
( S, V, T 2 ) must be expected. 

The need for many source codes and the in- 
clusion of many machines (and many configurations 
for each machine) in the FIELDATA system makes 
the normal automatic programming approach ec- 
onomically unfeasible. To go from s source 
codes to m machines patently requires s.m trans- 
lation routines - a prohibitively large number 
when the effort for each such translating rou- 
tine is considered. Even worse, if one had gone 
to the labor of constructing these s.m trans- 
lating routines, one would still be faced with 
an unmanageable chore in keeping them up-to- 
date in response to changing requirements.* 

Code-to-Code Translation 

The study of many problem oriented source 
codes and their translation into computer code 
reveals that various translation processes have 
many important common elements . Often commonness 
between translation processes can be found by 
examining their respective constituent parts. 

For example the translation of S^ to Tp - briefly 
(Si, Ti) - and the translation (S 2 , T 2 ) may both 
employ the same assembly system for the terminal 
portion of the translation. Sorting is a fre- 
quent common sub-process j at a finer level of 
analysis, table look-up, list forming, subdivid- 
ing a "sentence" into "words", economizing stor- 
age locations, etc., have frequent occurrence. 
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Another kind of commonness frequently found 
between two translation processes is the employ- 
ment of a common control sequence with differ- 
ences in regard to what is controlled. This is 
illustrated by generator systems (such as 
FLOWMATIC) in which the individual generators 
are properly regarded as sub -translators subject 
to an over-all controlling scheme. Two quite 
different pseudo-codes might be handled by the 
same controlling scheme with differences only 
in the library of generators being controlled. 

These examples indicate that code-to-code 
translation is coherent area of computation for 
which programming aids may be prepared. These 
programming aids (whose nature and form will 
be discussed below) wills 

1. Reduce by large factors the effort 
required to produce an operating program for the 
translation (S, V, T). 

2. Similarly reduce the effort required 
to modify existing translation programs in re- 
sponse to changed specification for S, V, or T. 

The ACT system is designed to be a pro- 
gramming aid such as discussed above. Once con- 
structed, it should have a far reaching effect 
on the manner and effectiveness of FIELDATA 
utilization. In working on some particular class 
of problems - or even just one large problem - 
programmers will find it convenient to refine 
their tools of program expression as they go - 
by modifying or adding to some previously a- 
vailable set of conventions. Thus code or 
"language" development will go hand in hand with 
problem solution, just as is normally the case 
in technical or scientific work without compu- 
ters. Furthermore the ACT system* should greatly 
facilitate the re-translation of a particular 
source code, originally translated to one ma- 
chine but now required for another. 


The Act System 

The ACT System is a collection of programmed 
devices which loosely fit together to help in the 
implementation of specific translations.** 

Also associated with the ACT System are two 
sets of writing conventions} one set governing 
the presentation of translation procedures - to 
be called "ACT translation procedures" - and the 
other to govern the presentation of data (i.e., 
source-coded programs submitted for translation.) 
This is analogous to computer systems, for which 
data and programing conventions must always be 
specified. A complete description of ACT will 
therefore include : 


* This is almost exactly the problem to which a proposal named UNCOL (sponsored by SHARE and described 
in ACM Communications , Vol. I Nos. 8, <?) is addressed. In our opinion the UNCOL proposal, while 
aimed in a valuable direction, is practically unworkable. For more details, see section on General 
Translation Library below, also a letter from George H. Mealy, Switching Systems Dept., Bell Tel. 

Lab., Inc., to the members of the UNCOL committee, dated Feb. 3, 1959. 

** Some of these components have importance and usefulness above and beyond their function within ACT. 
This is exactly comparable to parts of a computer (such as memory units or tape transports) which can be 
integrated into many different systems. 
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1. A description of major components. 

2. Writing Conventions governing ACT trans- 
lation procedures. 

3. Writing Conventions governing programs 
submitted for translation. 

The programmed components of the ACT system 
include : 

1. An allocation interpreter - AI 

2. A “core library” of basic translation 
functions 

3. A general translation library whose 
content varies with use of the system. 

Allocation Interpreter 

The allocation interpreter permits the for- 
mation of large translation programs out of 
large (and small) pre-stored sub-programs. Un- 
like assembly programs, it accomplishes its 
function dynamically - i.e., during the actual 
running of translation programs. This mode of 
operation is dictated by two important character- 
istics of translation programs: (a) frequent use 
of data dependent storages (b) the possibility 
of analyzing such programs into parts some of 
which are large sub-programs (unlike mathe- 
matical subroutines). 

Although AI participates at run-time , it 
does not cause any substantial reduction in 
computation speed. Almost all program ins- 
tructions are performed by the actual computer 
without interpretive interference. Only at 
infrequent intervals in program running time, 
when there is a change in requirements for 
internal or external storage space, is there 
a jump out of AI for bookkeeping and allocation 
activities, followed by a return of control to 
the running program, in this respect AI behaves 
in a manner analogous to that of contingency 
routines. 

The special jump -re turn instructions which 
interrupt the main computation for AI interven- 
tion might be conveniently viewed as new macro- 
instructions, added to the previously available 
stock of computer commands. These macro-in- 
structions are entirely concerned with space 
allocation, and programs written with these in- 
structions in mind have new formal properties. 
Correspondingly, a computer-plus -AI on which 
such programs are intended to run may be con- 
veniently regarded as an "Extended Machine" - 
EM. Figure 2 is based on this conception. 

The central cross represents the extension. 

Associated with EM is EM-code in which 
programs for EM must be stored. This code 
differs from ordinary machine code in several 
respects. 

1. The program is formally subdivided into 
program "phases", each phase headed by a phase 
description. The phase description lists 
"sequences" of "segments" of information be- 
longing to the phase and requiring space (in- 
ternal or external) when the phase operates. 

2. The phases of a program relate to each 
other through special calls (interpreted by AI). 
When a call for a new phase is encountered, the 
calling phase is suspended and the called phase 


initiated. When the called phase terminates it 
may either be held for re-use or dropped, de- 
pending on the manner of call. 

3. Aside from special AI macro instruc- 
tions having to do with space allocation (in- 
cluding new phase call) EM code contains or- 
dinary computer instructions with sequence- 
and-segment-relative addresses. These addresses 
are converted by a program loader into fixed 
form when the phase is first encountered during 
program operation. If a phase is held for re- 
use and then re-activated, it is returned to 
memory to the same set of locations to which 
it was originally assigned. Between uses of 
the same phase AI will have taken care not to 
assign any newly developed information which 
this phase requires to locations causing con- 
flict with old information already belonging 
to the phase. 

Although phases of EM-coded programs will 
normally be functional units (corresponding to 
a box on some flow-chart description), the con- 
verse need not be expected. The only use which 
is made by AI of the knowledge that a program 
Is subdivided into phases with particular inter- 
connections is to temporarily dump some infor- 
mation in favor of other information which is 
now required and which could not ‘otherwise be 
accomodated. Therefore functional units which, 
in their coded form, occupy very little space 
in the various computer media are not worth 
treating as separate phases. The bookkeeping 
involved in their handling is not worth the 
space return which they can bring. 

The power of EM may be said to derive from 
the following fact. In the case of EM programs 
the total space requirements of the program (for 
external and internal space) have been broken 
down into space requirements for individual pro- 
gram phases (where different phases may overlap 
by requiring the same information). At any one 
moment in computation time only one phase is op- 
erating and hence only its space requirements 
must be satisfied. Any parts of other phases 
which do not overlap with the active phase can 
be dumped and later restored if necessary. But 
the greater the amount of dumping and later re- 
storing the slower the program operates. If, 
therefore, there is enough memory space to per- 
mit the assignment of phases side-by-side the 
program will run correspondingly faster. In this 
way EM is able automatically to trade time for 
space depending on the total available storage 
of the computer and the sizes of various data 
lists which may have grown in course of compu- 
tation. 

The Core Library 

The core library consists of a collection 
of sub-programs - called translation procedures - 
which have near-universal importance in trans- 
lating pseudocodes into machine codes. (Many of 
these functions derive from a single fact about 
the relation of source to target: namely that 
the information in the source code is packaged 
by time of generation, while in the target code 
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it must be packaged by time of use)* 

The writing conventions - called "Canonical 
Form", abbreviated ”CF" - governing source- 
coded programs submitted for translation, are 
entirely related to the core library* All of the 
signals for which CF provides are decoded by one 
or several procedures in the core library. 
Conversely, all those translation procedures 
which only deal with signals specified by CF 
belong in the core library. The following 
is a partial list of functions fulfilled by 
the core library procedures. 

Coding Item Handling - Each CF item of pseu- 
do-code (such as a FORTRAN formula, an assembly 
instruction, a FLOWMATIC sentence, etc.) has a 
formal structure which provides for standard 
item features such as: (a) an item identification 
number (b) a symbolic item name (c) item type 
indications. In addition, after some preliminary 
editing, the body of the item may be recognized 
to consist of a sequence of symbols. The core 
library provides CF item handling functions which 
allow specific translation procesg.es to gain ac- 
cess to constituent portions of items by means 
of "item dissecting" commands. Similarly, if 
a translation procedure produces CF coding items 
as output, they may be assembled from their con- 
stituents by use of "item assembling" commands. 

Patch Interpretation - CF coding items may 
be generated (by programmer or generating rou- 
tine) in some order wholly different from the 
order in which they will be next required. Item 
rearrangement may be coded into the original item 
sequence by use of the item identification num- 
bers as well as by use of special "patch items". 
Core library functions will cause item re-order- 
ing, thus interpreting the "patch" intentions 
expressed in the original item sequence. 

Symbol In terpretation - By "symbol" is 
meant the smallest meaningful constituent of 
a coding item. ’Thus, the computer instruction 
•CLA B 5U2 * is composed of three symbols. Sym- 
ools may be of various types from the point of 
view of what interpretation they require during 
translation* Some symbols, for example, are 
absolute and require no interpretation; other 
symbols are ’dummy* and require replacement 
by specified egressions; yet other symbols 
refer to named coding items - i.e., are "ad- 
dresses" - and will require address interpre- 
tation. The core library contains a variety 
of procedures to effect symbol interpretation. 

Type Interpretation - CF coding items may 
contain marks to indicate their specific type. 
Thus, for example, a FORTRAN DO-statement and 
a dimension statement might be distinguished 
as to type; or a statement to be interpreted 
by FORTRAN as opposed to a statement to be 
interpreted by SAP may be distinguished by type. 
In general, type indications always mean that 
the items thus categorized, are to be respective- 
ly delivered to different translation procedures 
for treatment. Core procedures are used to de- 
liver CF coded items to appropriate translating 
routines in proper priority sequence, accord- 
ing to type indication. 


Call Interpretation - One of the frequent 
mechanisms employed in code-to-code translation 
is the adjoining of one or more batches of 
coding items, taken from a library of such 
"batches", to the coding items which already 
belong to the source coded routine being in- 
terpreted. A good example of this is the se- 
lection of subroutines from a library. After 
the subroutines have been picked up and (with 
some adjustments) adjoined to the routine un- 
dergoing translation, this latter routine has 
been 'expanded* by the inclusion of new coding 
items. Special items in the source coded rou- 
tine, called "call items" tell what particular 
batches are to be selected from the library for 
inclusion. 

In the case of subroutine selection, call 
interpretation may be carried out through a 
hierarchy of levels. The subroutines selected 
for adjoining to the calling routine may them- 
selves contain further call items requiring 
(for their interpretation) the selection of 
yet other library routines for inclusion. Thus 
the original source coded routine contains 
"first level" calls ; the subroutines thus se- 
lected may contain "second level" calls, and 
so forth, for nth level calls, h 

Call interpretation may be usefully applied 
to a variety of cases which have nothing to 
do with subroutine selection. For example, 
a routine coded in FLOWMATIC might be said 
to "call" on file descriptions from a library. 
The interpretation of such calls has much in 
common with subroutine selection. To go one 
step further: if there were a number of diff- 
erent files which all involved the same item 
description, then there might even be "second 
level" calls - the routines calling on file des- 
criptions, and the file descriptions calling on 
item descriptions. Core procedures are used 
to carry out hierarchic fetching and adjusting 
of library information. 

The General Translation Library 

The contents of the general translation li- 
brary are governed by the particular source 
codes and target codes for which translation 
procedures are desired. When stored in the 
translation library, these procedures are rep- 
resented in EM code - hence contain special ma- 
cro-instructions having to do with allocation. 
They also contain macro-instructions referring 
to translation procedures stored in the core 
library. In general, the time at which various 
core functions are brought into play in course 
of a translation is dependent upon features of 
source and target codes beyond those which are 
governed by canonical form. Hence the highest 
level control of a translation procedure always 
resides in a program belonging to the general 
translation library. With ACT, as presently 
conceived, there must be a distinct high-level 
controlling program for each s our ce -to-tar get 
pair (S, T) desired. Of course these controlling 
programs will, in general, be very short, the 
bulk of the work being encoded in sub-programs 
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belonging to general or core libraries.* 

One class of translation procedures expected 
in the library deserve special comment. In 
course of many code-to-code translations one 
reaches a penultimate code form which may be 
roughly described as "assembly code” and a final 
step in translation called ’’assembly". There 
is considerable variation between present-day 
translation systems (such as FORTRAN, FLOWMATIC, 
IT, etc.) in regard to the complexity of this 
final assembly phase. In the FLOWMATIC compiler 
for example, only the most rudimentary assembly 
phase, consisting of nothing more than relative 
address interpretation is employed ,5 while 
FORTRAN translation produces SAP assembly code 
with symbolic addressing, library reference 
ability, and many other features. 

In the development of ACT a special analysis 
was undertaken of the terminal translation steps 
to determine whether a widely applicable 'end 
game' could be prepared for inclusion in many 
different translation procedures. This analysis 
resulted in a set of specifications for assem- 
bly - system and code - which includes many 
features normally included in such systems but 
also some noteworthy additions. (Description 
of these lies beyond the scope of this paper.) 
Assembly procedures conforming to these speci- 
fications make extensive use of the core func- 
tions previously described as well as some 
additional ones. Since, however, such proce- 
dures always depend, in part, on the target 
computer, they are themselves represented in the 
general translation library* 

The success of the ACT system conception 
hinges critically on the feasibility of general 
translation library building, and we will now 
try to show that past experience with code-to- 
code translation justifies the expectations on 
which the over-all design of ACT is based. 

.It is well known that, in normal code trans- 
lation procedures, the source code is made to 
pass through some number of intermediate forms 
until, through final processing it becomes target 
code. 

(1) Cs — * C]q — *• C12 C13 ...... Cai-*-C<t 

In (1) the subscripts, II, 12, etc., are 
meant to suggest successive Intermediate forms, 
while A1 suggests Assembly form (A2, A3, etc., 
being possible intermediate forms through which 
the code passes during assembly). In FORTRAN 
translation, for example, the initial formula 
statements pass through several intermediate 
forms until they reach "triple" form j6 they are 
then translated further into SAP assembly code, 
etc. Generally speaking, as the code progresses 


from form to form, it becomes, step-wise, less 
and less conditioned by the specifications of 
the source code. Thus, for example, in FLOWMATIC, 
by the time the code has reached Op. file (1) 
form, it is largely independent of the detailed 
rules for FLOWMATIC statement writing. One 
could substitute many new conventions for 
FLOWMATIC sentence presentation, and correspond- 
ingly only alter the action of the so-called 
"glossaries" , which (together with some general 
controlling code) determine the first intermedi- 
ate code form. To put this another way: many 
details of FLOWMATIC code form are interpreted 
and done with in the first stages Cs — ► Cn. 

All such details could therefore be altered with- 
out requiring any revision in the compiler, save 
a change in the early phases. 

Exactly parallel observations apply to the 
terminal stages of code translation with regard 
to the details of the target code. Again 
choosing FLOWMATIC as an example: one can alter 
the target computer (hence the target code) 
with little or no effect on the early stages 
of translation which are source-code determined *-* 

Figure 3 shows a rough break-down of code- 
translation into stages which are successively 
more target-code determined and less source- 
code determined.**-* The division into gross 
stages as in Figure .3 is neither unique nor 
optimal. Its sole purpose is to show how the 
ACT translation library may be used to good ad- 
vantage. Suppose for example that a certain 
translation (Si, T^) is coded with exactly six 
intermediate stages, shown in Figure 3. The 
over all ACT program would read: 

(2) Do stage 1 
Do stage 2 


• 

Do stage 6 

This program would be stored in the ACT li- 
brary together with all six sub-programs which 
are called upon. Now suppose that one wishes 
to produce a translation for (Sj, Ti) where Sj 
differs from S-j_ only in its rules for operation 
and operand presentation (such as parenthesis 
notation for algebra replaced by Polish nota- 
tion; or the rules for file and item descriptions 
changed from "explicit" to "pictorial" form, 
etc.). The new translation may be obtained by 
writing a new controlling program 
(3) Do stage 1' 

Do stage 2 


Do stage 6 


* It is projected that a special component of ACT can be prepared, to be called the ACT "translation 
control" program. Highest level control of translation would then always be vested in this program, 
and the total number of individual translation procedures required to handle a large class of (S,T) 
would be greatly reduced. Further study is required to test the feasibility of this. 

** This observation is the basis for the so-called AIMACO system which translates FLOWMATIC code 
into 11@3 target code. 

-SHBt The fact that some computer effects enter the translation procedure almost from the beginning 
is what makes the UNCOL proposal - namely to accomplish all translations in two stages, the first 
source determined and the second target determined - unfeasible. 



and also coding a new sub-program corresponding 
to 1'. If, on the other hand, one wishes to 
produce a translation for (S^, Tj), one will 
have to consider how radically Tj differs from 
Tj • The more radical the difference, the great' 
er the number of intermediate stages which must 
be replaced. 

Consider next, the oroblem of producing a 
translation for (Sk, TjJ where Sk is a source 
code which allows parts to be written in Si 
and other parts in Sj with "type” labels to show 
which sections are written in accordance with 
which conventions. Suppose further that both 
translations (Si, Ti) as well as (Sj, Ti) were 
already available, and both coded in six stages 
as in Figure 3« The new control program would 
cause some number of initial stages for Si 
to be performed, and also some number of initial 
stages for Sj . As soon as both streams of 
coding have achieved a common form, they are, 
from there on in, submitted to the same terminal 
stages. How many stages would be involved be- 
fore the two streams converge is a function of 
how profoundly the two codes. Si and Sj, differ 
in their features as analyzed in Figure 3» 

These greatly over-simplified examples are 
meant to show that, because of the partial 
"■factoring" of source from target effects which 
one may realize in code-to-code translations, 
one will have ample opportunity for the re-use 
of major translation sub-programs to be stored 
in the ACT translation library. As already 
discussed previously, the extended machine base 
seems indispensible for the combining of such 
major sub-portions into entire operative 
programs. 
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tional requirements the object is determined ; 
the lower section tells by what coding forms 
the object is determined. (•EM-code* refers to 
the extended machine. In regard to the triplet 
(S,V,T) the machine used - extended or unexten- 
ded - is V. Because of the nature of EM, the 
ACT system can be transferred from one V to a- 
nother without recoding if the two machines diff- 
er only in configuration but not in order struc- 
ture. ) 
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Final Summary 

The primary aim of the ACT system is to 
reduce to a minimum the labor required to 
implement a given translation (Si, Vi, Ti). 

The reduction of labor is achieved through: 

1. Storing translational functions of 
almost universal applicability in the core li- 
brary aid providing for their incorporation in 
new translation programs. 

2. Making it possible for the translation 
library to grow in such a way that new programs 
(Sj, Tj) may use, as sub-programs, previously 
coded procedures already stored in the trans- 
lation library. 

Regarding item (2) above, its realization 
is entirely dependent on EM as the fundamental 
base on which ACT is built. The effictiveness 
of item (1), on the other hand, is dependent 
on the analysis of coding functions which leads 
to the definition qf canonical form. 

(Many of the representational facilities 
which CF provides will not be directly employed 
in programmer -written source codes, but will 
appear in intermediate code-forms which are the 
result of early editing and other translations.) 

Figure It, is a schematic summary of the 
three main portions of ACT. Each box in the 
figure is horizontally divided into three sec- 
tions. The middle section tells by what func- 
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THE COMPUTER OPERATION LANGUAGE 
George F. Ryckman 

General Motors Research Laboratories 
Warren, Michigan 


Summary 

The marry machine oriented and problem ori- 
ented languages existing and being planned today 
give rise to more complex problems in operating 
a computer. The development of computer operat- 
ing systems have materially aided the problem 
of getting a program or series of programs on 
and off the computer efficiently. The variety 
of language processors have created a new op- 
erating system function - that of interpreting 
a language describing the processing to be per- 
formed for a given job. In particular the lan- 
guage is used to call a given processor, per- 
scribe pertinent options and other control 
information. As such it is termed a computer 
operation language or operator oriented language. 
Not only does this language serve to free the 
human operator of many clerical tasks but It 
also provides for more consistent and error- 
free operation of the computer. 

Introduction 

In recent years the computing profession 
has recognized two classes of computer languages. 
The first of these is termed the machine oriented 
language. Characteristic of this language is 
that it is tied quite closely to the hardware 
itself. That is, coding in this language is 
either directly interpretable by the computer or 
an assembly process produces a language which is 
directly interpretable by the hardware. The as- 
sembly process then is working on a one-to-one 
translation basis. Examples of machine oriented 
languages are basic coding, relative coding, and 
probably the most used, symbolic coding. 

A second computer language is called a 
problem oriented language. It is characterized 
by having few. ties to the computer hardware. 
Rather it is aimed at problems in specific areas 
of mathematics, engineering, physics, accounting, 
and so on. Characteristic also of the problem 
oriented language is the compiler or generator 
which examines the source statements and gener- 
ates strings of basic machine code. The compiler 
is not only doing all of the basic machine cod- 
ing, but in many cases it is also doing some of 
the analysis and even furnishing numerical tech- 
niques. One of the more important differences 
between the machine oriented language and the 
problem oriented language is that it requires 
fewer pencil marks to state a problem in the 
problem oriented language. This in turn means 
less effort, less time, and less cost involved 
in preparing a problem in a problem oriented 
language. Still there is a need for the machine 


oriented language for those problems which are 
not conveniently expressed in any of the pre- 
sently prepared problem oriented languages. 
Examples of problem oriented languages are 
Flowmatic, Fortran, and the data processing 
generators. So much for the machine oriented 
language and problem oriented language. 

The Computer Operation Language 

I should like to introduce a third class 
of computer language. It is one which is known 
but probably not as universally recognized as 
the first two. We call this the computer op- 
eration language. Like the problem oriented 
language, the computer operation language has 
few ties to the actual hardware. Unlike the 
problem oriented language, it is aimed at in- 
forming the computer of the major steps to be 
taken in processing a job. The information 
supplied to the computer is much like the 
operator instructions written for the machine 
operator. Relatively unimportant is the dis- 
tinguishing characteristic that the computer 
operation language is processed by an inter- 
preter. This of course is not essential, and, 
in fact, the language could just as well be 
processed by a compiler. 

Turning our attention for the moment to 
some history in the development of the computer 
operation language, we began experiments in this 
area in the summer of 1957. At that time we 
were processing something over one hundred jobs 
per day on our computer with the aid of an op- 
erating system. Our only coding system at the 
time was a symbolic assembly program. Anti- 
cipating a wide use of Fortran which had just 
become available, we began the design and check- 
out of an operating system for the Fortran com- 
piler. One of our first requirements was that 
the programmer have the ability to compile a 
program in Fortran and then immediately execute 
it all on one machine pass. We also wanted him 
to be able to easily control these options. As 
a result we made our first attempts at a com- 
puter operation language. The Fortran operat- 
ing system had an interpreter built into it 
which recognized such simple terms as compile, 
execute, and others. As more features became 
available in Fortran, their terms were added 
to the interpreter's dictionary or vocabulary. 
Fewer than twenty- five verbs and nouns comprise 
the vocabulary today. However, there is un- 
limited expansion room. 
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Environment of the Conputer Operation Language 

Recognition of the computer operation lan- 
guage is significantly important in the evolu- 
tion of computer languages. The user has a 
powerful command of the computer. In our ex- 
perience the language has been enthusiastically 
accepted by the programmer. First, because it 
is easy to use and second, because it gives him 
consistent results in the processing of his pro- 
blem on the computer. From the standpoint of 
the computer operating system this language is 
an input, informing the system of the actions 
to be taken for each Job. A few words about 
operating systems is in order at this point. 
Charged with the major responsibility of op- 
erating the computer continuously and smoothly, 
the operating system shows up most favorably 
when there are a large number of jobs to process 
in a day. The author recognizes that operating 
systems were originally justified on the basis 
of such a machine load. It takes effort in 
terms of man-hours to devise these systems, to 
expand them and to maintain them. On the other 
hand, our payoff was great enough in terms of 
savings both on the part of the user and also 
in more efficient machine operation to justify 
the expenditure. Some users of present-day 
computers may find they are unwarranted in 
developing an operating system on the basis 
that they are running a relatively few problems. 
It should be pointed out that long-running pro- 
blems on today's computers will be short-running 
on the next generation of computers. But more 
important are the benefits accorded the user 
through other functions of the operating system, 
e.g., more powerful and consistent control of 
the computer via the computer operation lan- 
guage. 

Developments in Planning 

I should like to explore what might be the 
future of the computer operation language. At 
the time of this writing we had just completed 
the incorporation of a new coding system in our 
operating system. The Surge compiler is a pro- 
blem oriented language processor which is de- 
signed to aid in the preparation of data pro- 
cessing programs. It will sort a file of in- 
formation, update it, and prepare reports from 
a file. In addition to adding it to our master 
tape, we have also increased the vocabulary of 
our computer operation language. The computer 
now understands a call for Surge along with 
execution of the resulting program if so de- 
sired. Many of the options available under 
Fortran processing are also available under 
Surge processing, and accordingly are under- 
stood by the computer. The effort of incor- 
porating Surge amounted to approximately one 
man-month - an insignificant part of which was 
the expansion of the computer operation language 
vocabulary. It was not surprising to learn 
again, as we had with Fortran, that the Surge 


coding system was considerably enhanced just 
by virtue of the fact that it was more readily 
usable. The user prepares the Surge statements 
as well as the processing statements which are 
all incorporated in one dqck of cards and sent 
to the computer room for complete processing. 

It would be well at this time to bring out 
another important advantage of the language in 
that it has the inherent ability to merge parts 
of a program which have been coded in different 
languages. As an example, a program which had 
been partly coded in symbolic language, partly 
in Fortran, and partly in Surge could be quite 
conveniently combined, loaded into memory, and 
executed as a single job. The merging can be 
done at any level at which the languages are 
in fact compatible, such as at the symbolic 
language level. This advantage of the language 
should not be underestimated. 

In the near future we intend to add still 
another coding system to the package. The 
Dyana coding system, which was reported at an 
earlier Joint Computer Conference, will be 
added with the necessary extensions to the 
vocabulary. In this case the words: DYANA, 
EXECUTE, will trigger the following actions 
on the part of the computer. First the DYANA 
processor will be retrieved from the system 
master tape, and it will commence processing 
the DYANA source statements and as a result 
will produce Fortran statements. Given the 
Fortran statements, the system then understands 
that it is to bring in the Fortran processor? 
and as a result of this operation, the object 
program will be produced. In response to the 
word EXECUTE then, the system will load the re- 
sulting object program into memoiy and transfer 
control to it. The DYANA object program will 
get its input from the same source which con- 
tained the DYANA source statements, and it 
records its outputs on the appropriate media. 

At the conclusion of execution, the DYANA ob- 
ject program releases control to the system to 
bring in the next job. Thus through the com- 
puter operation language the programmer has 
controlled a sequence of processes which do not 
require any human intervention. 

Potential of the Language 

Given a large capacity memoiy device with 
reasonably good access, it will also be pos- 
sible to store a number of production programs 
in the computer. Through the use of the com- 
puter operation language, the programmer may 
then call any one of these production programs, 
supply it with appropriate data, and produce the 
necessary results. 

Future developments of the computer op- 
eration language may well include some condi- 
tional statements, boolean expressions, de- 
clarative statements, and others which are 
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presently found in problem oriented languages. 

The macro concept can be used to good advantage 
in specifying a sequence of processes. In one 
of my earlier examples, I mentioned a call DYANA. 
This was interpreted by the computer to mean call 
the DYANA processor and when it has completed 
its compilation, call the Fortran processor. In 
a very elementary sense then, the command DYANA 
is a macro. 

Figure 1 shows the potential power which 
might be built into the computer operation lan- 
guage. The first statement is a simple call for 
a processor Dyana. Next the programmer has 
specified the action to be taken if a machine 
error occurred on the first try at a Dyana 
processing. He wants to go back and recall 
Dyana. The second statement calls a report 
generator. Here again, if a machine error 
occurs on the first try he wants to go back 
and recall the report generator. Assuming 
these two processes are successfully completed, 
he then wishes to set a debugging mode followed 
by an execution of the object programs result- 
ing from the Dyana and report generator pro- 
cessors. During the execution of this program, 
if something causes an interrupt, he wants 
tapes A and B to be saved. If on the other 
hand an error occurs, then he would like to 
get a dump of all core variables and also 
tape A. Following the completion of all pro- 
cessing he wants to exit to the next job. At 
the bottom of the illustration is a macro which 
will perform the same functions as all of the 
statements 1 through 5>. In this case a simple 
CALL ABLE. 

One of the points which becomes quite clear 
in this discussion is the necessity for process- 
ors such as SAP, FORTRAN, SURGE, etc., to behave 
as subroutines to the operating system. They 
must be subroutines in the sense that they are 
informed by the operating system of the machine 
configuration available to them for each exe- 
cution. As an example a processor must be told 
where it will find its input(s), where to re- 
cord its outputs, what primary and secondary 
storages are assigned to it and even where the 
processor itself is stored. This information 
may be thought of as parameters in a calling 
sequence at the time the operating system calls 
the processor. The operating system is of 
course aware of the needs of a given processor 
and proceeds to call that processor if the re- 
quired portion of the machine configuration is 
available. Alternate sets of needs should be 
allowed which will allow the processor to op- 
erate more or less efficiently on the computer. 

The major reason for subroutinizing pro- 
cessors is solely to allow compatible operation 
of more than one processor on the computer. Two 
or mo‘re processors which make a priori assump- 
tions concerning input, output or other storage 
use probably preclude compatible operation. A 
simple case is that of two processors both of 


which assume they theyselves occupy the first 
position of a given tape. The programmer 
effort required to eliminate these incompat- 
ibilities is not inconsiderable. It should be 
noted that some large processors have already 
been written as subroutines such as the SHARE 
Operating System for the 709 computer. A second 
reason for subroutinizing processors is to allow 
a program to be written in more than one lan- 
guage, as stated earlier. The saving of each 
segment of the program as processed is another 
function of the operating system as directed 
by computer operation language statements. In 
summarizing this point processors must be writ- 
ten as subroutines before the full potential 
of the computer operation language can be readily 
realized. 

Conclusion 

Although the computer operation language 
has been used in many of today's computer 
systems, it has never been extracted and re- 
cognized as being a powerful concept. Once 
it has been recognized, it will be much more 
fully exploited. The SHARE organization's 
General Standards Committee is currently con- 
sidering the impact of the computer operation 
language on future computer systems. The lan- 
guage is a key to the sound and successful 
operation of the computer from remote stations. 

It will one day take its place with the machine 
oriented language and the problem oriented lan- 
guage. 
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A Program Written in a Computer Operation Language 


Statement Number 


Statement 


1 CALL DYANA 

IF (ERROR FIRST TRY), GO TO 1 

2 CALL REPORT GEN 

IF (ERROR FIRST TRY), GO TO 2 

SET DEBUG MODE 
EXECUTE 

IF INTERRUPT, GO TO 3 
IF ERROR, GO TO 1* 

GO TO 5 

3 SAVE TAPES A, B 

GO TO 5 

li DUMP CORE VARIABLES 

TAPE A 

$ EXIT TO NEXT JOB 


MACRO FOR STATEMENTS 1-5 
CALL ABLE 


Figure 1 
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A New Approach to the Programming Problem 


William Orchard-Hays 
C-E-I-R, Inc. 
Arlington, Virginia. 


The computing and data processing field as we 
know it today, that is, employing high-speed EDP 
equipment, although but a few years old, has 
grown accustomed to major advances, changes and 
reworkings of systems in quick succession. How- 
ever, the increasing complexity of programming 
and of applications, together with the rapid 
appearance of several powerful hardware systems, 
has now reached such proportions as to cause 
considerable uneasiness and confusion. It ap- 
pears sometimes that the effective lead time for 
preparing for a new machine has not only gone to 
zero but become negative. As to problems, it 
has become a cliche’ that they are always wanted 
"yesterday" . 

It would be hard to deny that the programming 
profession has brought some of this on itself 
although the total problem is much broader than 
what is usually considered programming. It ap- 
pears sometimes that the equipment users have 
not kept pace with the equipment makers in 
imagination, sensible distribution of effort and 
just plain pride of workmanship. Yet, a real- 
istic evaluation of the work done on large 
machines in the past three or four years will 
elicit admiration from anyone capable of under- 
standing it. But mingled with the understanding 
admiration is likely to be a trace of fear that 
man cannot yet effectively utilize the powerful 
machines which he is building. Some of the most 
ambitious, thoroughly planned and well funded 
projects have been little short of fiascoes. 

And machines five times as powerful are on the 
threshold, with other machines twenty times as 
powerful as that following. 

A brief look backward is sometimes helpful. In 
earlier days of machine computing, say as far 
back as 1952-3, there was a keen interest in 
getting jobs done and a lively anticipation of 
doing these jobs better on the equipment soon to 
be available. The dark ages of the 602 and 6oU 
had given way to the transitional CPC with oc- 
casional excursions on the EDVAC, SWAC or UNIVAC. 
People were steeped in the care and precision of 
EAM procedures, still believed in the doctrine 
of economy of space and time, and still accepted 
a challenge to do some operation in a better way. 


Machines were looked on as tools , and it was 
assumed that people would be required to make 
effective use of them — people who understood 
them. It was also accepted that computing de- 
partments existed to serve some higher purpose 
and that it was the function of the programmer 
or machine specialist to provide means to make 
machines useful to those with problems. 

It would be hard to say that these notions have 
ever been deliberately scrapped but, somehow, 
they no longer seem to describe current atti- 
tudes. Perhaps it is just that the problems 
have gotten harder faster than we have gotten 
smarter. We can hardly be accused of idleness. 
Programs have proliferated in ever increasing 
sizes until today one person can scarcely be 
aware of all the routines available for one 
computer, let alone understand their use. At 
the same time, the number of new jobs presented 
which require original programming has grown at 
least as fast. This in itself, of course, is 
encouragingj we do not wish to work ourselves 
out of a job. But neither, I think, do we wish 
to attempt to re-define every field of tech- 
nology. I am aware that much lip service, and 
much real effort, is devoted to providing better 
programming aids and "automatic" systems, pre- 
sumably for those who have a need to exploit 
machines but who have neither time nor inclina- 
tion to become professional programmers. Few 
would claim an outstanding success here. Per- 
haps the trouble is that we are attempting to 
bridge an ever-widening gap with the same double 
step that was effective in the days of the CPC, 
namely the general programmer and the system 
programmer . 

The standard solution to the problem of "getting 
on the air" with a big project has been to pro- 
liferate ever larger programming groups - by 
hiring, training, renting, pirating or any other 
means including dumping a bunch of people in to 
sink or swim. This method is becoming increas- 
ingly ineffective. A large number of people 
doing a mediocre job only adds to management’s 
problems. This is not to say that the people 
are mediocre. Rather they often lack direction. 
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However, that there is a definite limit to the 
number of potentially top-flight professional 
programmers - dedicated, intensely interested, 
highly skilled and capable of creative work - is 
apparent to anyone in the industry. This limit 
is indeed small, I would guess in the hundreds. 

This problem has not gone unnoticed, of course. 
Besides a number of elaborate processors and 
operating systems, there has been talk of a 
universal problem- oriented language, of a uni- 
versal machine -oriented language, and even of 
some kind of a universal language for both pur- 
poses. People evidently take these seriously 
since there are many active committees in exis- 
tence. But just what meaning do such phrases 
have? 

Let us look at the problem-oriented language 
first. Before we can begin, we must assume that 
it is somehow possible to foresee every kind of 
term that will ever be used and every kind of 
action that will ever be taken in defining and 
solving a problem. We can, you may say, assume 
synonyms and isomorphisms so that it is not 
necessary to pre-define every actual combination 
of characters and symbols. But this assumption 
implies a recognition and/or memory capability 
which far exceeds anything we now know outside 
of the human brain. Moreover, there is the 
deeper implication that all classes of quanti- 
ties and operations are now known. To say that 
the operation of defining an operation will be 
permitted is only evading the issue since this 
implies that we know how many innovated opera- 
tions will interact with all existing operations. 
Given any defined language, it is not very diffi- 
cult to show that there 1 are reasonable problems 
which may require instructions, i.e. notions, 
outside the language. Hence no problem language 
can be universal. 

Similar arguments can be made regarding a 
machine -oriented language. Consider a simple 
example. One of the tasks to which such a lan- 
guage must be adequate is the translation of any 
program to a form usable on a machine and, at 
the same time, responsive to the needs of the 
problem formulator. Suppose this person is 
studying improved methods of handling his prob- 
lems and wishes to determine how much informa- 
tion is required in translating a program. Then 
he must be able to write a -program which outputs 
information about every action of the processor. 
Hence the processor must accept modifications to 
and interruptions of itself. But then it is no 
longer merely the processor of the original 
language. Hence he must be using a more ex- 
tensive language. 


There are other and more powerful arguments 
against the feasibility of a universal machine- 
oriented language. This is betrayed by the fact 
that such supposed languages have a bad habit of 
turning into rather specific problem-oriented 
languages where the problem is how to describe 
a machine, whereas what is desired is the 
ability to use a machine. Yet the notion of a 
universal machine-oriented language system does 
have meaning, it seems to me, in a somewhat 
different sense. 

In order to develop this argument, it is neces- 
sary to consider a few basic principles. I will 
restrict myself to four, all of which have to 
do with the dynamic nature of computing. 

1. The introduction of new ideas and new 
applications is an inherent part of 
EDP technology, both in the manufacture 
and in the use of equipment. Some of 
these new ideas and applications have 
unanticipated but far-reaching logical 
implications. 

It seems to me that a failure to recognize this 
principle has caused us to get farther and 
farther behind. We are forever coming up with 
excellent solutions to problems which we 
muddled through a year previously, while at the 
same time trying to cope with current problems 
which have new requirements. Furthermore, we 
should certainly be aware by now that every 
major procedure, when perfected, is looked on 
by the analyst as an available technique or 
operation and that he will seek new degrees of 
complexity in his next project. No fruitful 
line of development is ever complete and very 
few computer program systems will remain static 
if they are found useful. This is verified by 
many years of consistent experience in nearly 
all installations. Compatibility is a greatly 
overrated virtue. It is desired most urgently 
when there is a need to reprogram, say for a 
new machine, _a_ horrendous complex of codes 
which we have lost control of and no longer 
understand. We don't really want the compat- 
ibility, which stifles advancing the art, but 
rather we fear to expose the muddle into which 
our own work has gotten. 

2. Since the handling of information 
involves the generation and manipulation 
of information about information, the 
growth of data handling requirements „ 

is self -compounding unless carefully 
constrained. 
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This principle is insidious, but true. It might 
be called the Parkinson’s Law of data processing. 
It is interesting to note that human beings, in 
doing clerical work or calculations, impose the 
necessary constraints in the form of what we 
call common sense and judgement. The dullest 
sort of person is able to exhibit these qual- 
ities in far greater measure than we often admit, 
but the brightest genius has difficulty in pre- 
cisely defining them. Which leads to the third 
principle . 

3. Nothing can be mechanized which cannot 
be precisely defined. This is as true 
of second, third and higher orders of 
control as of primary mechanization. 

I have come to the conclusion that many people 
who ought to know better don’t really believe 
this principle. Sometimes I try to ignore it 
myself, always to my eventual grief. It is too 
tempting to solve easy problems, pushing the 
hard ones back with a sort of mental note that 
"we'll do something" when the time comes. If 
pressed we may mumble something about statis- 
tical methods, which sounds impressive. Now 
I would not want to appear to minimize the worth 
of faith, self-confidence, and drive. But it is 
foolish to undertake a piece of work without 
having first estimated the effort required. In 
data processing, we have sometimes been trapped 
into doing this because of a failure to under- 
stand the true complexity of what we are asking 
for. Thus complex, far-reaching and important 
decisions and plans are left to those who are 
unqualified to make them, technically, admin- 
istratively or policywise. 

The inability to precisely define all aspects 
of a technology does not bother us too greatly 
in most endeavors. A car still requires a 
driver, a plane a pilot. We think of an assem- 
bly line as highly mechanized but it works only 
because many people are working on it and making 
little spot judgements all the time, sometimes 
big ones. But who can work at the rate of 
hO ,000 additions per second, even in a super- 
visory capacity? Of course, the very essence 
of the stored-program concept is to allow com- 
plete pre-planning, which amounts to canned 
supervision, of the whole sequence of operations 
required by some process. There are two catches 
to this. The first is our second principle, 
i.e. the bookkeeping soon outweighs the data 
processing operations and becomes a bigger data 
processing problem than the original one. The 
second catch is that it is impossible to pre- 
plan the proper action for every eventuality? 


in fact, it is questionable whether the term 
"proper" has any meaning in this context except 
as dictated by particular policy. All this 
is simply a roundabout way of saying that 
human beings cannot be taken out of the daily 
activities for which they are responsible and 
which exist in the first place by their decision. 
The introduction of new machinery - no matter 
how revolutionary, powerful or potentially 
useful - does not change this basic fact of life. 
Rather than relieving people of effort or re- 
quiring less in the way of disciplined endeav- 
ors, this new equipment is more demanding if 
its use is to be profitable. 

Let us admit boldly, or humbly if you prefer, 
that there is a lot of confusion in data 
processing work today. Let us further admit 
that there is a great deal of difficulty and 
uncertainty in the projects to which DP equip- 
ment is often applied. If there weren't, most 
of us would be out of work. Now while it may 
be possible to cut down on the confusion, it 
appears realistic to me to assume that the 
problem difficulties will increase, certainly 
not decrease. These problems are not the 
exclusive domain of any one discipline. Cer- 
tainly they do not all belong to the field of 
programming. Programming has its share of 
tough problems and also its share of skill and 
helpful techniques to be contributed. What 
makes the programmer invaluable is that he 
knows how to make a computer perform. What 
makes him inadequate is that he doesn't always 
have the script. After all, problems originate 
in people's minds and techniques of operation 
reside in other people's minds. We need to 
remember, and to believe, that the really 
valuable residue of a completed job consists 
of the ideas, insights and techniques recorded 
in human brains, in short the experience 
gained. This leads to the fourth principle. 

1+. A wide variety of skills are involved 
in present-day EDP applications thus 
multiplying the problems of commu- 
nication. 

Communication evidently means different things 
to different people — which seems odd. But 
one must recognize that vast efforts are under- 
way to increase communication between systems, 
sub-systems, or just plain code and systems. 

It is not clear to me that communication 
between two inanimate objects can exist but I 
admit my knowledge of psychology is limited. 

At any rate, I am more concerned with commu- 
nication between humans and by humans with 
machines . 
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Now there are at least three important condi- 
tions for communication: 

(1) The parties concerned must have rapport 
in the area under discussion; 

(2) the quantity and rate of information 
sent at one time must be within the 
capacity of the receiver to assimilate; 

(3) the language and medium used must be 
known and compatible to both parties. 

A lack of any of these can be compensated for by 
sufficient determination, repetition, patience 
and time. In EDP, however, efficiency must be 
given great weight. We cannot always afford 
to compensate for poor communication. 

It might be asked how one can gain rapport with 
a machine. The best way is to build into the 
programs the kind of receptors and output 
devices that one would like. But this option 
is usually available only to a limited number of 
people, namely, expert system programmers. But 
once the user is one step removed from the 
machine, as say a production programmer, the 
communication is mainly between people . Further- 
more, the system programmer now has "a new respon- 
sibility, viz., to provide techniques which can 
be responsive to the user without major overhauls 
in handwritten code. Repeat, responsive. 

Turning to the other end of the spectrum, how 
can the language and working papers of the 
physicist, economist, engineer, or operations 
researcher be compatible with the language and 
experience of the professional programmer. 

Except in isolated cases of fortuitous back- 
grounds, the answer is they can't. The truth 
is that there is a whole spectrum of languages 
from the problem formulator to the machine 
operator, with only slight intersections. I 
dislike to call these language "levels". It is 
not a question, at least not anymore, of one 
being above or below another. The language of 
the engineer is every bit as specialized as that 
of the programmer. Generalizations can be made 
in both directions but this may or may not help 
to get a job "on the air". 

Now the machines are not the exclusive domain 
of any one group, any more than the problems, 
except in an administrative sense. The problem 
formulator is just as anxious, and probably 
more needful, to exercise control and introduce 
changes from his viewpoint and with his lan- 
guage as is the system programmer. I think we 


all recognize the fact that human beings still 
must work out together the roles they will each 
play in getting a job done. That is, that the 
preliminary discussions, planning, mutual 
training, etc. are going to be with us for a 
long, long time. What we apparently have 
failed to recognize is that the machine itself 
can be used in such a way that it can help us 
remember, control, modify and use the fruits 
of all this labor much more efficiently. 

What should the system programmer be’ doing? He 
should not be building systems for himself. He 
should be providing standard parts and routines 
with which systems can be built. Of course, 
his consultation will be needed but the whole 
system should be literally built by the whole 
spectrum of people who are going to rely on it. 
How can the programmer anticipate the kind of 
system and languages that the next project will 
require? He must be prepared to build any 
definable system or at least to supervise its 
building. That is his specialty. The same 
thing is true of language. What we need is not 
a universal language but the ability to define 
and utilize any language that suits our purposes. 

Consider a moment. Systems of 10,000 - 15,000 
instructions are routine, 23,000 - 50,000 
instructions are not uncommon. Some plans now 
underway will likely require 50,000 - 75,000 
instructions. Where do we stop? Some of these 
systems are supposed to be all things to all 
people but on closer examination it turns out 
that they generally do about one thing well. 
Maybe they compile algebraic statements, or 
maybe they assemble and execute handwritten 
programs or maybe they perform certain standard 
D.P. functions. But they are only straight- 
line cuts out into a whole field to be harvested. 
Furthermore, they are difficult to tinder stand 
and require a staff of specialists to service 
each one. Far from facilitating communication, 
they vastly complicate it. 

Suppose then we begin by trying to extract from 
all systems those parts which are common. We 
will, without question, have to establish 
certain rules and conventions, for example, 
the card form for instructions to a monitor 
(any monitor) and the first and last words of 
each tape record, etc. In other words, we have 
to establish standards. But these standards 
must not be restrictive to the user. Their 
only purpose is to allow standardized parts 
from which any system can be started. We must 
not forget that knowledgeable people must be 
integrated with systems TT versatility is to be 
maintained. 
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In addition to a package of standard parts, 
there is something more needed. This is a 
standard philosophy of using them. There is a 
somewhat subtle distinction here. We do not 
want to make all systems be just alike, other- 
wise we may as well program a standard system. 
The idea is to have standard techniques to 
cause a system to grow in whatever way is needed 
at the time. The word "grow" is not used light- 
ly. Certain characteristics of an organism must 
be provided. To me, the notion of a universal 
computer-oriented language has meaning only in 
this dynamic sense. 

There is one further requirement. The method 
must not lead to unnecessarily cumbersome 
systems, that is, this must not be inherent. 
With the present state of the art, at least, 
this will require some rather stringent convent- 
ions regarding symbology. Hopefully, they can 
be relaxed as experience is gained. I do not 
see this as a great problem, however. People 
are pretty good at picking up rules of this 
kind and at getting used to cryptic notations. 
They may grumble but their real problems usually 
lie elsewhere. If they understand the structure 
of the system and its attendant language, the 
actual encoding is not too important. 

What are some of the features that might be 
expected to be available with such a method? I 
think the following is a minimum list. 

(1) A core of highly polished utility sub- 
routines, a standard skeleton monitor, 
and a basic assembler, all capable of 
expansion in well-defined ways and 
maintained by a single small group 
(per installation) of highly skilled 
professional programmers. 

(2) The capability of defining hierarchial 
sets of macro-operations tailored to the 
job and understood to the depth required 
by the various echelons or categories of 
professional people involved. Since 
they would be involved in defining them, 
they should easily understand them. 

(3) The ability to make changes at any level 
provided that at least the next lower 
level were understood, and to add levels 
as need dictates. 

(10 The ability to debug in detail at object 
time without detailed pre-planning. 


(3) Complete revision of a system by 

revising definitions and restarting 
from the basic system, with a minimum 
of elapsed time. 

(6) Partially automatic documentation of a 
system since higher levels of opera- 
tions will require fewer lines of 
programming. 

(7) Ability to experiment with systems 
as such, and to extract production 
systems at will. 

(8) Alleviation of the training problem 
by spreading the load now imposed 

on programming over a wider variety of 
skills . 

Let me re-emphasize that such a technique as 
just outlined is going to be very demanding 
of the people using it. Responsibility is a 
necessary counterpart to responsiveness. The 
ability to exercise higher level control 
implies the ability to make bigger, more 
disastrous errors. With proper safeguards, 
however, this is somewhat compensated for by 
the ability to quickly reconstruct a previous 
situation. 

To sum up the foregoing very briefly, no one 
can become expert in many fields simultaneously. 
Programming must provide access to machines for 
others, via the programmer but not “by the 
programmer on every occasion. To expect to find 
individuals who can understand all the goals, 
restraints, techniques and policies of a large 
project and still be capable of translating 
these into a mechanized system is unrealistic. 

To hope that a system can be defined by one 
group and turned over to another for imple- 
menting is just as unrealistic. Everyone must 
play his proper role from the outset and 
communication must exist between members of a 
team, at the least by pairs. After all, systems 
exist to serve some human purpose, not to be 
self-sufficient automatons. The trick for 
making the hardware a useful member of a team 
is to construct systems which are self-adaptive 
,in a machine oriented sense, but externally 
responsive in a problem oriented sense. The 
know-how to accomplish this exists. 

As will be seen, there are implications of so- 
called self-organizing systems here. I believe 
that the reduced cost per computation, promised 
by the bigger, faster and more expensive 
machines around the corner, can only be achieved 
in practice by some such innovation. This is 
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not, however, a new idea, as suggested at the 
outset. It is simply the old idea of breaking 
down a technology grown complex into simpler 
pieces understandable separately by a wider 
cross-section of the population. The skills so 
highly developed and so cherished by the expert 
programmer are as needed and as important as 
ever. But his field hgLS also become so special- 
ized that he must concentrate on particular areas 
to be effective. He must use his skill to open 
up new facilities to others so that they too 
can work more effectively. Only by a multi-link 
chain of people between problem and machine can 
the growth of EDP be kept on a workable basis 
such as gave it its initial impetus. 
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Summary 

A machine is described which recognizes line 
drawings of circles, triangles, squares, pentagon^ 
and hexagons. This identification is independent 
of rotation and, within limits, of size, precision 
of drawing, or positioning. The system will also 
distinguish and count separate objects up to six 
with limited independence of the size, shape and 
position of each object . 

This "Gestalt" recognition is accomplished 
by using a dilating circular scan. The virtue of 
such scanning is that object-size changes trans- 
late into time-of-arrival changes while object 
rotation preserves topological relationships. 

The n-gon detection depends on side counting, and 
the object counter uses the same logic in conjunc- 
tion with a detection of discontinuities. 

Thirty-two small photocells arrayed in a 
circle are mechanically puckered across the input 
plane . Light signals derived from opaque line- 
drawings are converted into binary representation 
in a special thyratron register. Time sequencing 
of logical operations and output displays is relay 
controlled. The actual decision logic uses a 
novel combination of optical and electronic de- 
vices in a simple but effective manner. 

It is also shown that this circular scan 
technique is applicable to automatic detection of 
letters and numbers over a wide variety of styles, 
and several examples are given. 

Introduction 

One of the very interesting classes of 
problems in pattern recognition is that of the 
detection and classification of two-dimensional 
visual images such as written language and line 
drawings. Research of this nature is interesting 
because it may elucidate some of the problems of 
human and machine pattern recognition; it may be 
useful as well since it can lead to devices such 
as automatic reading machines. This paper dis- 
cusses some of the problems encountered in such 
visual pattern recognition and describes a new 
approach to machine recognition of printed and 
handwritten letters and numbers. 

In the past, machine detection of two- 
dimensional visual patterns has been generally 
restricted to carefully constructed and positioned 
alpha-numeric characters . Ideally however, 
pattern recognition should be independent of 
character size, rotation, and within limits, 
position, style, and noise . Such general or 
"Gestalt" recognition takes advantage of a set of 
characteristics defined to be invariant under 


these transformations, and can be usefully 
applied to many detection problems . 

If we want a machine to make identifications 
which agree with those a human observer might 
make, the rules for classification of "essential 
features" must be similar for each. The defini- 
tions of letters, numbers, triangles, or any 
other symbols are quite arbitrary and are made in 
terms of the structure and limitations of the 
observer’s mechanisms. Hence it is necessary to 
be quite careful when establishing the "essential 
feature" requirements for any classification. 

These essential features may be quite diffuse 
and exist only within ill-defined limits. 

Recognition of visual patterns is most in- 
teresting and valuable when it is not restricted 
to a small class of inputs. We should like to 
understand how an A or a triangle may be detected 
irrespective of position, rotation, dilation, or 
of variations in style . A really useful reading 
machine will be one which can recognize the 
symbol 5 whether it is continuous or in two parts 
and whether it has been printed by a machine or 
scrawled by a child. 

There are in existence a number of systems 
and proposals for obtaining limited recognition. 
IBM*-*-' has developed computer programs for reading 
a variety of alpha-numeric characters. Intelli- 
gent Machines Research'^' and Solartrom 3 ) have 
machines for sale which read numbers and letters 
printed in one carefully controlled type font . 
Stanford Research Institute \ ^ ' and others have 
produced magnetic ink check readers . 

Most of these systems have similar design 
concepts, based on inspection of sequences of 
pulses generated by scanning with a slit aperture 
moving past a symbol. Segments of numbers and 
letters are detected by matching pulse sequences 
for best fits to stored matrices of segments. 

The number, size, and position of these elements 
determines recognition of a character. 

These devices have limited utility because 
of their critical dependence on the style of type 
which can be reliably read and their sensitivity 
to format and to tilt and size changes. They 
generally come into being to serve one specific 
purpose, that of high speed reliable recognition 
of one fixed symbol style. The IBM system is 
built around their output printer type font, the 
IMR scheme for credit card reading uses a letter 
and number style especially created for this 
purpose, and the SRI magnetic-ink marks must be 
carefully held to close size and position toler- 
ances . 
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The principles that these systems use to 
achieve recognition are not very general or 
easily extendable. The desire for automatic 
reading of income tax forms, train car markings, 
postal addresses, bank checks, toll tickets and 
the like will have to be met by machines using 
much more powerful algorithms than those present- 
ly available. Considerable effort is currently 
being put into finding more general and flexible 
systems for pattern recognition, (5-10) but there 
are many difficult problems yet to be solved 
adequately. 

One new approach which attempts to get at 
some of the problems of achieving more general 
recognition will now be described. Following 
that is the description and circuit details of a 
simple pattern-recognition machine based on that 
system. 

Principles of Operation 

It is possible to inspect a simple line 
drawing in a way which yields similar transfor- 
mations for geometrically similar figures, in- 
dependent of their absolute size or rotation. 

This "Gestalt" recognition is accomplished by 
using a dilating circular scan. The virtue of 
such scanning is that object-size changes trans- 
late into time -of -arrival changes while object 
rotation preserves topological relationships. 

Consider an array of picture elements 
arranged in c concentric rings, having r elements 
in each ring. If each ring is inspected sequen- 
tially, say from center outward, the resultant c 
sequences of r signals can be shown to have the 
desired transformation properties. A logically 
equivalent implicit scan is possible with uni- 
directional propagation along radii and with 
unit delays inserted between adjacent elements 
on each radius. This type of signal processing 
has interesting analogies to physiological 
structures . 

Suppose that an equilateral triangle is 
centered on such an array. The first inter- 
sections of the scanning interrogation ring with 
the figure will be three simultaneous "hits" on 
the mid-points of the triangle’s sides. As the 
scan continues its outward sweep, points on each 
side of the initial intersections are progress- 
ively touched. The last parts of the triangle 
which are inspected are the vertices, those 
points at the greatest radial distances from the 
center . 

Thus the outward puckering scan produces 
three radially equidistant epicenters of activity. 
An accumulating register of r places, represent- 
ing the r elements of each ring in turn, will 
display three single active cells initially, 
equally spaced along the length of the register. 
There will then be a spread of activated cells 
in each direction from each of the initial loca- 
tions. Finally the apices of the equilateral 
triangle will cause the last three cells of the 
register to be activated. Again these are equally 


spaced, representing the 120° radial symmetry of 
the vertices. 

A similar but smaller triangle would have 
generated similar signals earlier in the sweep 
cycle; conversely for a larger presentation. 
Rotational changes are represented as lateral 
shifts in the contents of the register. Thus the 
detection of three simultaneous, equidistant, and 
uniformly expanding waves of activity is suffi- 
cient to "recognize" an equilateral triangle 
independent of size or rotation. (Bounds on size 
exist, of course. The lower limit is fixed by 
the resolution of the array of elements while 
the upper bound is determined by the field seen 
by the array.) 

An isosceles triangle will generate simply 
detectable time and position asymmetries . 

Squares and rectangles are differentiable in a 
similar manner; their unique common property in 
this system is, of course, a count of four. A 
convex polygon of any number of sides can be 
detected, given sufficient resolution; circles 
are represented by a simultaneous filling of all 
places in the inspection register. 

Let us consider now how a system can be 
developed to make use of such scanning . Suppose 
for example that the input transducer consists 
of 32 concentric rings of 64 photocells each, 
as in the illustration below. 



Let a sweep cycle comprise the simultaneous 
gating of all 64 cells in each ring, one ring 
at a time, into 64 corresponding amplifier 
channels . Thus for an outward puckering scan, 
the innermost ring is first interrogated, then 
the next, etc., until the gating out of the con- 
tents of the outermost ring completes a sweep 
cycle. Fig. 2 shows, in block diagram, one 
system for using this information. 

The! beginning of each sweep initiates a 
storage operation in each of the 64 function 
generator locations . A linear accumulation 
(e.g., current) continues in each location until 
stopped by the first black element encountered 
by the expanding sweep. The value of each func- 
tion at the end of sweep may be read out as a 
voltage level which is proportional to the radial 
distance of the corresponding image point from 
the center of the array. 
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At the end of a sweep, a comparator makes 
peak detections on the 6k generated functions . 
This operation is performed "by local differencing 
over several adjacent locations. A simple count 
of such peaks is sufficient to identify an n-gon, 
while a measurement of the inter-peak distances 
(given by the addresses of the radii on which the 
peaks occur) establishes criteria for separation 
of isosceles and equilateral or of square and 
rectangle . 

It can be shown that the alphabet of line 
drawings recognizable by this system is not re- 
stricted to a few polygons. Many figures have 
unique "invariants" if they are interrogated 
with an expanding circular scan whose origin is 
near the center of gravity of the figure . 

A modest number of more complex comparator 
operations produces much more sophisticated 
recognition. Detection of concave and convex 
segments distributed in time over the 6k channels 
leads to identification of numerals, for example. 
A series of gedanken experiments on many examples 
of hand-printed digits has shown that very modest 
extensions of logic considerably enlarge the 
classes of recognizable images. Fig. 3 shows 
several examples of numerals which have been 
"recognized" in these experiments. 

Each of the four numerals was drawn free- 
hand and roughly centered on a layout of the 
photo-receptors. The rectangular grids are ob- 
tained by unfolding the circular array after 
making a cut from the center to the top of that 
array. Thus points along the top of the rectangle 
represent points on the inner circle of the 
photocell matrix while the bottom line contains 
the points forming the periphery of the circular 
array. The left and right vertical edges of the 
rectangle were coincident in the original layout, 
being the two edges of the cut we made . 

This simple transformation makes it con- 
siderably easier to follow the scanning events 
and to discover useful detection criteria. The 
expanding sweep circle is seen to be represented 
by a horizontal line 6k cells wide moving down in 
time through 32 positions. Any given horizontal 
line may be thought of as the contents of a 6k 
place register at any epoch. Each activated 
receptor location on the circular array is shown 
as a dot in the rectangular representation. 

It can be readily seen that size changes 
would appear as vertical (time) shifts while 
rotational transformations cause lateral dis- 
placements of the pattern. Since the left and 
right edges of the rectangle are actually con- 
tiguous in the original array, lateral displace- 
ments would simply wrap around the rectangular 
grid, disappearing off one vertical edge and 
reappearing at the other. 

A set of tests for the examples shown in 
Fig. 3 will now be given. These tests are for 
purposes of illustrating principles and should 
not be considered the most complete or effective 


rules which can be found. We shall invoke two 
simplifications; the numerals will be centered 
and there will be no rotation. 

An ascending function will mean a mostly 
continuous mostly monotonic decrease in values 
of c as r increases. The first 2 shown would be 
said to have an ascending function from r = 0 to 
r = 6; conversely the first 5 has two descending 
functions between r = 0 and r = 6 . In a similar 
manner we may describe horizontal segments, con- 
cave or convex functions, etc . We test for the 
following : 

1. An ascending function for 0 < r < 8. 
This describes the curved section of the 
upper right portion of most 2*s. 

2 . An ascending function for 2k < r < 32 • 
This describes that portion of the tail of 
a 2 lying to the right of center. 

3. A partially horizontal or ascending 
function for 32 < r < ^0. The presence of 
at least a short segment which is level or 
rising during this interval indicates the 
left end of the tail or the bottom of the 
down- stroke for most 2’s and is independent 
of whether or not a loop is present. 

4. A two-valued function for 0 < r < 8 and 
60 < r < 6k establishes the presence of the 
top bar and top of the curved portion of 

a 5. 

5* A descending function for l6 < r < 40 
describes the tail of a typical 5* 

The limits on r need not be taken as 
absolute. If our tests are based on relative r 
values, then the results of the tests can be in- 
dependent of rotation of the sample numerals. 

Rules based on the five tests listed above 
are sufficient to obtain unique "recognition" of 
the samples given and of a variety of other ex- 
amples of these two numerals. The first three 
tests concern properties of the number 2 which 
are relatively invariant over a population of 
about twenty samples which have been checked. 
Similar rules have been successfully applied to 
all of the ten digits for a number of hand- 
printed samples . The extension to letters of 
the alphabet while more complicated, does not, 
in principle, appear difficult . 

A Model to Demonstrate Principles 

A simplified model has been built to 
recognize line drawings of circles, triangles, 
squares, pentagons, and hexagons . This identi- 
fication is independent of rotation and, within 
limits, of size, precision of drawing, or posi- 
tioning. The system will also distinguish and 
count separate objects up to six with limited 
independence of the size, shape and position 
of each object. 
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The present device simply detects and counts 
the segments intersected during one expanding 
sweep cycle. It is the largest count which de- 
termines the correct number for identification. 

In general, this maximum count may occur only 
briefly during part of a cycle, or it may be 
generated by several different segments encount- 
ered sequentially during the cycle . For any non- 
concentric placement of the figure with respect 
to the sweep center, one or more sides will be 
encountered earlier than the others . The rela- 
tionships are depicted for the case of a square 
in the illustrations below. 



In A, the innermost circle represents the minimum 
sweep diameter where no part of the figure has 
been encountered. The next circle out, corres- 
ponding to a time later in the sweep cycle, in- 
dicates the four segments cut from a well-centered 
figure, and the succeeding circle has just en- 
countered the vertices. Finally the sweep has 
passed out of the square; all portions of the 
inspection circle have encountered the figure . 

A slightly non-concentric placement, as in 
B, results in first one side of the square inter- 
sected, then three, perhaps four for a moment, 
then one again. This is in contrast to the case 
A where if any sides were contacted at any in- 
stant, all four were . In C where the decentering 
is excessive, there are never more than three 
sides intersected at any radius of the expanding 
sweep. It is required then to detect, store, 
and sum the number of separate sides encountered 
during one sweep cycle. This number can never be 
more than n for any position of an n-gon, and 
except for accurate centering of regular figures 
it is generally less than n at any instant. 

Let the expanding circular sweep be quan- 
tized into a sufficient number of elements, and 
suppose that those elements which have not passed 
through the figure are given the value of 0; 
those which have encountered the figure are 
called 1. The system then must count the transi- 
tions between strings of ones and zeros around 
the circle . This count thus indicates the number 
of sides in the figure. 

We can distinguish separate objects in the 
following manner. Notice that for any closed 
figure lying within the swept area and enclosing 
the center of sweep, all ones will eventually be 
generated. This is not the case for separate 


objects placed in the inspection field as in the 
illustration T^elow. 



Here there will be four discrete strings of 
ones and zeros but not all zeros will have been 
changed to ones by the end of sweep. This simple 
fact is used to provide discrimination of 
separate objects from continuous (or nearly con- 
tinuous) line drawings. 

Notice that the count does not depend on 
the size or shape of the objects, but only on 
continuity of form, i.e. on unbroken strings of 
ones . 


The block diagram for this system is shown 
in Fig. 6. In this version, 32 rather than 6k 
quantized positions are used in each ring. Also, 
successive positions of the expanding sweep pro- 
vide continuous signals rather than the 32 dis- 
crete signals described earlier in the general 
model . 

The requisite scanning and resultant signals 
which provide input for the machine is obtained 
from 32 small photocells which are mechanically 
puckered across the input plane. As successive 
portions of the line drawing are encountered, a 
thyratron register records this information. At 
the completion of a sweep cycle, this stored data 
is examined for one - zero strings, i.e. - a series 
of ones followed by a series of zeros . 

Conventional means for determining these 
binary sequences in a register involve shifting 
the contents of the register or otherwise se- 
quentially commutating the digits. The present 
system avoids this complication by a simple 
parallel operation. Note that for a count of 
the numbers of strings of ones and zeros , the 
essential information is contained in the tran- 
sitions between strings, i.e., the one - zero or 
zero - one locations in the binary number. For a 
closed ring of digits, as in the register repre- 
senting the state of the 32 photocells, the 
numbers of strings of ones and zeros are equal. 
Consequently there will be 2n transitions for n 
strings of ones . Since the number of strings 
of ones (or zeros ) corresponds to the number of 
sides of the inspected figure, a triangle is 
represented by 6 transitions, a square by 8, etc. 
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Parallel inspection of the thyratron regis- 
ter is obtained, by connecting a small neon lamp 
(NE-2) between each pair of adjacent thyratron 
plates. As photocells pass beneath portions of 
the figure being scanned, they fire their respec- 
tive thyratrons, which thus change state from 
zeros to ones . Those thyratrons yet unfired 
because their photocells have not been darkened, 
remain as zeros . At the junctions of ones and 
zeros , the neon lamps light, since there is 
sufficient potential difference between the 
plates of fired and unfired thyratrons. As can 
be seen in the illustration below, n sides pro- 
duce 2n discontinuities and thereby 2n lighted 
neons . 
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FIG. 7 

The lighted lamps are "counted" in the 
maximum-count detector circuit (see Fig. 6), 
which sums the light output of these lamps and 
produces a voltage proportional to the number of 
sides of the figure . This part of the system 
consists of an enclosure with the 32 neon lamps 
arrayed in it together with a photocell. The 
signal from this cell is amplified, peak detected, 
and stored in a capacitor. 

Since the peak detector indicates only the 
maximum number of neons lit at any instant, there 
will be fewer than the correct number of sides 
indicated in some cases. Consider Figures B and 
C of Fig. 4. Unless the object and scan centers 
are coincident, there may not be n intersections 
at any one time; rather the n sides of the object 
may be encountered successively during the sweep. 
One simple solution to this problem is to "save 
sides", i.e. to prevent the n strings of ones 
from flowing into each' other as the vertices of 
the figure are encountered. This is accomplished 
by inhibiting that thyratron controlled by the 
last photocell passing through a vertex. It is 
locked up by adjacent thyratrons (thus preserving 
the sequence . . . .101 . . . ) and consequently the 
neon lamps on either side remain lighted for the 
duration of the scan. 

In the final step a level detector classi- 
fies the stored signal and lights up the corres- 
ponding display. Since the amplitude of the sig- 
nal is proportional to the number of sides in the 


figure inspected, the level detector is set to 
report a hexagon for the highest signal, a penta- 
gon for the next lower, and so on through square, 
triangle, circle, and no-figure . 

This detector circuit comprises five thyra- 
trons, each of which has a preset firing thres- 
hold corresponding to the signal it is expected 
to classify. The stored, peak-detected signal 
is applied in common to all five stages. A 
common holdoff bias is then removed from one 
thyratron at a time, starting with the hexagon 
(highest level) detector and proceeding mono- 
tonically to the circle (lowest level) detector. 
When that stage is encountered where the signal 
is sufficient to fire the thyratron, recognition 
occurs . 

This machine can also count up to 6 small 
objects placed at the inspection station. Only 
if a closed figure is inspected will all photo- 
cells be shadowed, whereas if discrete objects 
are presented, only a few cells, those passing 
beneath the objects, will have responded by the 
end of a sweep cycle. The distinction is easily 
made by monitoring the total current of the 32 
thyratrons . 

Notice that the count does not depend on 
the size or shape of the objects, but only on 
continuity of form, i.e. on unbroken strings of 
ones . The detection scheme used for the present 
device will erroneously count as one object any 
two or more which lie on a single radius, even 
though they are separated (at different radial 
distances). A more elaborate scheme can easily 
be made to count such objects separately. 

A working model of this system, built to 
demonstrate principles, is shown in Fig. 8. A 
pentagon figure is on the horizontal platen, the 
central portion of which is the inspection sta- 
tion. The figure has been inspected, and the 
responding display appears in the small vertical 
panel behind it . Beneath the platen may be seen 
the mechanism which moves the scanning photocells 
in radial paths and which drives a rotary switch 
for controlling the sequence of events in a 
scanning cycle. At the very top of the apparatus 
is the lamp (within the pyramidal shade) which 
transilluminates the figure for scanning. 

Fig. 9 shows four objects which have been 
counted, and the answering display. 

This machine serves to illustrate the 
feasibility of using a circular scan to identify 
or count simple objects independently of rotation 
and, within reasonable limits, of dilation or 
translation. It is relatively uncomplicated, 
despite the detailed appendix which follows. 

Appendix 

In this section, some of the essential 
features of the pattern recognizer are described 
in greater detail. A complete schematic of the 
system is shown in Figure 10. 
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Scanning and Control Mechanism 

The scanning photocells are actuated by a 
motor-driven mechanism which comes into play when 
identification of a figure is requested. The same 
motor drives a rotary tap switch which programs 
the sequences and durations of voltage applica- 
tions to different parts of the circuit . A com- 
plete scanning cycle lasts about 1 second. 

The photocells are mounted on rods which 
are part of a mechanism that is very much like an 
umbrella rib assembly. The cells are attached to 
the free ends of the rods and face upward. As 
the mechanism is operated, the cells, forming a 
circle of variable radius, move along radial 
paths. In a cycle, the cell assembly is first 
brought in to a minimum radius of about 1.5 
inches, is then expanded at a linear rate of 4 
inches per second to a maximum radius of about 
5 inche s . 

The rotary tap switch has two sections of 
32 contacts each, one section being used to close 
control relays and the other section to open them. 
These relays enable and disable different sections 
of the circuit by switching appropriate voltages. 
The sliding contactor arrangement in Fig. 10 
represents the operation of these relays. 

Wot shown in this sequence chart is a relay 
which operates when the button is momentarily 
pressed to request identification. The relay 
locks itself up and turns on the overhead light 
and the driving motor. 

Regi ster 

In this section of the circuit, the one 
signals derived from the lines of the figure by 
the photocells are used to fire thyratrons, thus 
registering these events for subsequent use. 

Each of the 32 photocells triggers a thyratron 
as it passes under a line of the figure to be 
identified. 

At the start of a cycle, the 32 thyratrons 
are disabled (no plate voltage) so that the 
scanning mechanism can come up to speed. When it 
does, relay 1 applies 150 volts to the plate bus 
and the active scan cycle begins . As a photocell 
passes under a line of the figure, it generates 
a positive pip, large enough to overcome the bias 
on the thyratron control grid, causing firing. 

The 32 thyratrons form a closed ring, tied 
plate-to-plate by small neon lamps shown as L7, 

L8, L9, etc . When one thyratron is conducting, 
its plate potential falls, while that of its 
neighbors remains high, and so the two lamps on 
either side will light up. If a neighbor is 
fired, the lamp between the fired thyratrons, 
having little or no potential difference across 
it, goes out. However, another lamp goes on 
between the newly fired thyratron and its un- 
fired neighbor. As more thyratrons are fired, 
lamps are progressively turned on and off, the 


lit one at any moment locating the boundary of 
fired and unfired stages. Thus as the sweep 
expands, we continue to have a pair of lit neons 
for each line segment of the figure as is shown 
in Fig. 7* We see then that as the cycle pro- 
ceeds, two lamps will light at the first points 
of contact with each side of the figure, then 
separate and run out toward the vertices. 

If the figure to be identified should be 
displaced relative to the circle of cells so that 
one or more vertices are completely scanned before 
others, the correct number of lamps (2n) vail not 
be on simultaneously. Since the photocell in the 
lamp housing reports only what it sees at any 
instant , it will report too low a number. 

The circuit of 32 thyratrons has a "side- 
saving" feature which makes it unnecessary to 
center the figure precisely on the sweep center. 

We can save sides in the following manner: if 
the thyratron controlled by the last photocell 
to sweep through a vertex can be prevented from 
firing, the neons on either side of it will re- 
main on even after the circle of photocells has 
dilated past this corner. If this action takes 
place at all vertices, the lighted neons denoting 
sides will all remain on. The photocell in the 
lamp housing will then see 2n lit lamps corres- 
ponding to all sides scanned, regardless of the 
temporal sequence . 

This feature is realized by using the plate 
voltage swings of fired thyratrons to inhibit 
firing of adjacent stages . A resistor network 
couples each plate to the screens of both neigh- 
bors. If a tube is flanked by either one or two 
unfired stages, its screen is kept at ground by 
the associated diode which is being held in con- 
duction. This condition allows normal operation 
of the stage . However, if both neighbors are 
fired, the screen of the tube between them will 
be driven sufficiently negative to inhibit firing, 
the clamping diode releasing for this condition. 
Thus this stage (corresponding to a vertex) is 
never fired, and the two sides have been saved. 

The 1|1F capacitor, 4.7 K resistor, and 
diode associated with each screen prevent in- 
advertent lockout. When both flanking stages 
fire, a slight delay is introduced (t ~ 4.5 MS). 
This gives the flanked stage an opportunity to 
fire even if the request occurs late due to minor 
electrical or mechanical uncertainties. This 
delay is long enough to accommodate all such 
variations encountered, but is short compared to 
the normal rate of progression of cells through 
a vertex, thus permitting lockout to occur for 
the desired conditions . 

A decentering of the inspected figure by 
fifteen percent or so will usually cause no error. 
This lockout action is unreliable, however, when 
the figure is so positioned that two photocells 
pass through a corner at the same time . The 
thyratrons controlled by these cells will both 
fire, and the associated neons will go out 
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immediately. In practice, the number of errors 
due to this positioning is extremely low. 

Maximum- Count Detector 

Having translated the number of sides n in 
a figure into 2n lighted neons in the register 
circuit, we next convert this information into an 
electrical analog. 

The 32 neon lamps are in a light-tight 
housing, together with a photocell. Since this 
cell has a nonlinear input-output characteristic 
near the origin, a bias source of light (two 
grain- of -wheat incadescent lamps) is used to move 
the operating range into the linear region . The 
signal from this photocell is amplified by a 
chopper- stabilized operational amplifier and is 
peak detected and stored by a diode and capacitor. 
The charge on the capacitor is then proportional 
to the number of sides in the figure. 

Display Selection 

In this final step of the recognition cycle, 
the electrical analog of an n-gon is classified 
and then identified by a lighted display. 

This part of the system uses five thyratrons 
each with a relay in its plate circuit, to oper- 
ate the appropriate display. The thyratrons have 
different triggering thresholds, arranged in a 
graduated series. The signal previously stored 
on the capacitor is presented to all five control 
grids simultaneously, via a common connection, 
but the firing thresholds are reached sequentially. 

The screen grids of these tubes are 
connected to biasing networks through diodes. 

Each screen is set to a different bias in a 
stepped series. While the signal is being stored 
on the capacitor, the 5 thyratrons are inhibited, 
since their screens are held deeply negative by 
relay 2 (see sequence diagram) until the active 
scan is completed. This relay then removes the 
-33 volt supply from the screen bus, and the 
screens rise toward the different biases deter- 
mined by their networks. Capacitors from these 
screens to ground are part of RC delay networks, 
which permit the screens to reach their preset 
biases sequentially at intervals of about 5 
milliseconds . 

Now let us take an example . A signal of 
+35 volts, a typical value for a square, appears 
on the storage capacitor, and hence at the control 
grids of all 5 thyratrons. None can fire, how- 
ever, until the screen bus is released from the 
-33 volt supply which is holding all stages com- 
pletely cut off. When this inhibition is removed, 
the lower thyratron in the schematic (correspond- 
ing to a hexagon display) rises almost immediately 


to its preset bias, there being no delay built 
into this stage. With the bias selected, this 
tube vail not fire with the +35 volt signal on 
its control grid. Five milliseconds later, the 
"pentagon" thyratron reaches its preset bias, 
which is also large enough to prevent firing for 
this signal. Another five milliseconds later, 
the "square" thyratron reaches its threshold, 
which is such that signals over 30 volts will 
cause firing. Consequently it fires, its plate 
relay closes and lights up the display of a 
square. At the same time the storage capacitor 
is discharged through grid conduction in the 
fired thyratron before the remaining stages 
(triangle and circle) reach their thresholds, 
and so no other stages can fire. 

Object Count 

When discrete, opaque objects on the 
platen are scanned by the 32 photocells, thyra- 
trons controlled by intercepting cells are fired, 
while the rest are not. At the juncture of fired 
and unfired thyratrons, neons will light up, 
two for each object. Since two neons also light 
up for each side of a polygon, the processing 
of the signal is the same for both figures and 
objects, except for the display. 

If a figure is being scanned, all thyra- 
trons are fired, except the n inhibited by the 
sidesaver; if objects are being scanned, only a 
few thyratrons are fired. The difference is 
detected by a relay (6) in series with the plate 
supply for the 32 thyratrons. This relay is 
adjusted to operate when about 2k of the 32 
thyratrons are fired. In turn another relay (7) 
is operated to switch between the object display 
and the figure display. 

Relay 7 also has another function. In 
the identification of circles, should the figure 
be precisely concentric with the scan circle, 
all 32 thyratrons will fire essentially simul- 
taneously. Consequently, there will be no lamps 
turned on and no signal generated. In this case, 
relay 7 (which is energized because > 2k thyra- 
trons were fired) brings the storage capacitor 
to 0 volts (from -33V). This is sufficient to 
fire subsequently the circle thyratron. 

I acknowledge with pleasure the design 
of the mechanical pucker by C. F. Mattke and 
the coaxing of the electronics to perfection 
by S. E. Michaels. 
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N-GOKI RECOGNIZER 



FIG. 2 

Block Diagram of N-Gon Recognition System 


































FIG. 6 

Block Diagram of Recognition System as Constructed 


12.1 









362 

12.1 



FIG. 8 

N-Gon Recognizer, Showing Mechanical 
Scanner and Output Indicator 
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Schematic of N-Gon Recognizer 
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RCA's Automatic Store and Forward Message Switching System 

by 

T. L. Genetta, H. P. Guerber and A. S. Rettig 

Electronic Data Processing Division 
Industrial Electronic Products 
Radio Corporation of America 
Camden, New Jersey 


INTRODUCTION 

AutoData is an automatic, fully transistor- 
ized store and forward message switching sys- 
tem. It is designed for use in common user com- 
munication networks in which message traffic in 
digital form may be relayed throughout the world. 
When the message switching system is used, many 
advantages may be obtained, some of which are: 

• Economic utilization of communication 
facilities 

• Automatic transfer of messages through 
the network 

• Effective utilization of communication 
facilities operating at different data 
rates, with different codes, and with 
different coordination procedures 

• High degree of message protection 
all with, 

• Building block expansibility to provide 
increased service as required. 

The design features of the system are pre- 
sented by first describing the configuration of a 
communication network, then explaining the basic 
switching center functions, outlining the main sys- 
tem design considerations, and finally reviewing 
the flow of messages through a typical switching 
center. 

THE COMMUNICATION NETWORK 

Figure 1 shows the elements of a typical 
communication network using AutoData Message 
Switching Centers. Messages originate and ter- 
minate at subscriber terminal stations (Ti through 
T ) that are connected to the centers by tributary 
channels, or at Electronic Data Processing Equip- 
ment (EDPE) stations which may be local or re- 
mote from the centers. The switching centers 
are interconnected with trunk channels. All com- 
munication channels and their associated line ter- 
minal equipments are of types currently available 
in the communication industry. 



Figure 1 

The message traffic handled by the com- 
munication equipment discussed here is always 
in digital form. Each message consists of three 
parts: header, text, and ending (See Figure 2). 

The header contains all the information needed 
to route the message. The text comprises the 
intelligence to be transmitted. The ending dis- 
creetly marks the termination of the message. 

The header and ending must consist of characters 
of information organized in a manner such that 
they can be interpreted by the switching center. 

The text may consist of plain text characters, 
encrypted characters, or bit-stream traffic 
organized into checkable pseudocharacters. 

Messages are interchanged through the 
network by sending them first to the nearest 
switching center. They are automatically re- 
layed among the switching centers and finally 
distributed to the destinations listed in the headers. 
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TYPICAL MESSAGE FORMAT 
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ORIGINATING STATION ROUTING 

STATION SERIAL INDICATOR 

NUMBER END-OF-ROUTING PROSIGN 

(FOLLOWS LAST ROUTING INDICATOR ) 

MESSAGE TYPE DESIGNATOR 
PRECEDENCE PROSIGN 


Figure 2 

A message may be relayed among different types 
of communication equipment, such as teletype, 
punched card transceivers, as well as the new 
high-speed digital data transmission devices 
which will be available soon. These equipments 
have widely varying characteristics. The data 
transfer rates will vary from around 30 to 3000 
bauds. Data characters will be represented in a 
variety of codes varying from 5 to 8 elements, 
with different redundancy features. The control 
procedure used on the various channels will differ. 
Some will provide closed-loop control while others 
provide no return path for signalling; some will 
be fully automatic while others are manual. The 
modes of transmission will include several types 
of keying: For example, DC keying, tone keying, 
frequency- shift keying, and phase-shift keying 
and will be over wire line, microwave, or radio 
circuits. The AutoData equipment provides com- 
patibility between these many types of communi- 
cation devices. 

SWITCHING CENTER FUNCTIONS 

A simplified functional diagram of a Mes- 
sage Switching Center is shown in Figure 3. 



Figure 3 


Th e Input f unction receives message charac- 
ters from the various incoming channels and ac- 
cumulates them into blocks conditioned for proc- 
essing. This input function includes character 
and block checks and exercises control over the 
transfer of messages over the incoming communi- 
cation channels. These accuracy checks together 
with the ability to repeat blocks allows errors in 
transmission to be corrected automatically. 

The Message Processing function interprets 
the heading of each message and directs it to the 
appropriate outgoing channel. As the blocks of a 
message are received from the Input, they are 
transferred to the Intransit Store; only when the 
message is complete does it become a candidate 
for transmission. As each outgoing channel be- 
comes available, the Message Processing func- 
tion transfers to the Output the oldest message 
in the highest precedence category for that out- 
put channel. A stored program data processor 
is used for this function. This provides great 
logical power, a high degree of flexibility and 
very fast message handling speeds within the 
center. 

The Intransit Store function provides a re- 
servoir for messages when their outgoing channels 
are busy. It is this function that provides the 
’’store and forward” capability of the switching 
center. It permits any subscriber to send his 
message without waiting for a through connection. 
It permits one incoming message to be delivered 
to any number of addressees. It allows messages 
to be transmitted out in sequence by priority as 
well as by age. It further allows top priority 
traffic to automatically interrupt lower priority 
messages and provides the means for repeating 
the interrupted messages. 

The Output function coordinates the trans- 
fer of outbound traffic on each tributary and trunk 
channel. Storage capability is provided for each 
channel to maintain continuous transmission. 
Repeats of traffic blocks are provided by the out- 
put function as requested by the next switching 
center or tributary equipment. 

Messages to and from local EDPE and 
other High Speed Input-Output devices have ac- 
cess to the switching center by way of the Mes- 
sage Processor. Local Low Speed Input-Output 
equipment is handled like any other tributary 
channel. 

DESIGN FEATURES 

Some of the more important design features 
of the message switching system are illustrated 
in Figure 4. These features are fully automatic 











367 

12.2 


except in a few cases where the subscriber ter- channels operate at the lower baud rates . 
minal equipment does not permit. 



Figure 4 

The communication channels are shown 
terminating in an equipment complex called Tech - 
nical Control . Technical Control contains switch- 
ing facilities which provide flexibility in the con- 
nection of the various types of buffer units to the 
corresponding types of communication channels. 

In some installations this switching capability is 
manual and in others it is semiautomatic. A 
further refinement provides for patching one com- 
munication channel directly to another. This 
allows any subscriber to be connected directly 
to any other subscriber in the network; a direct 
connection of this type is convenient in cases 
where large volumes of traffic are to be trans- 
mitted to a single destination. Needless to say, 
this capability can be exercised only when both 
terminals and all the intervening communication 
channels are compatible. 

Technical control also includes test equip- 
ment for monitoring and maintaining the various 
communication channels. 

The Input function of Figure 5 is composed 
of the Buffers, Buffer Commutator, Code Con - 
verters and the Input Store. 

The Buffers provide independent termina- 
tion for the various communication channels. A 
family of buffers is available for use with the 
various types of communication terminal equip- 
ment. This permits different data rates, differ- 
erit coding and different control signalling to be 
used on the various channels. The present sys- 
tem is designed to handle data rates from 30 to 
3000 bauds on each channel with a maximum of 
50 incoming channels. The number of channels 
may be increased in groups of 25 if some of the 



Figure 5 

The Buffer Commutator causes all the buf- 
fers to be examined every 1-1/2 milliseconds. 
This speed is fast enough to require storage of 
only one character in each buffer. 

A family of Code Converters is available, 
one for each code conversion required. Each 
converter is shared by all the channels using its 
assigned code, converting characters in that code 
to characters in the common language code of the 
center. Some channels, such as trunk channels 
may use the common language code. Characters 
from these channels, of course, bypass the code 
converters. 

The Input Store consists of a magnetic core 
high-speed memory with storage areas assigned 
to the individual incoming channels. The exact 
storage location for each incoming character is 
obtained from tally information also stored in the 
high-speed memory. The tally system also pro- 
vides the basis for channel coordination. 

The philosophy of channel coordination used here 
is one of control by the receiver. As each block 
of characters is received, control characters are 
sent over the return channel either requesting the 
next block of characters or requesting a rerun of 
the same block. In this manner, no block is ac- 
cepted unless it passes all the checks provided 
on that channel. Message numbering is available 
to provide protection on those channels operating 
without feedback control. 

The Communication Data Processor (Fig- 
ure 6) incorporates a wide variety of advanced 
programming and logic features: 

• Basic Memory consists of 16, 000 56-bit 
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words accessible on al-l/2/is cycle 

• Provision is made for overlapping oper- 
ation of two memory units 

• Instructions are of variable length with 
provision for index registers for relative 
addressing, and address substitution to 
achieve direct addressing 

• Data tagging is provided for automatic 
identification of messages as well as for 
data separation 

• Simultaneous operation is provided. The 
number of routines that can take place 
simultaneously is not fixed, but is a 
function of the relative speed of any as- 
sociated devices involved and the com- 
plexity of the individual routines. Such 
simultaneous operation is particularly 
useful for file searches and general 
’'housekeeping” as well as for servicing 
on-line input/output devices. 



Figure 6 


As many of the required system operating 
features are included in the CDP as possible. 

Some of the system requirements that are pro- 
grammed are: 

• Routing of messages according to single, 
multiple, and collective routing addresses 

• Transmission by priority and age 

• Interruption for top priority traffic 

• Alarms for messages not processed out 
within a time period specified by pri- 
ority 

• Intercept on magnetic tape of selected 
traffic 

• Editing and format conversion 

• Collection and display of statistics 

• Automatic routing by both routing ad- 
dress and priority. 

These are some of the special functions provided 
by the Communication Data Processor in addition 


to its primary function of transferring traffic 
blocks from the input store to the intermediate 
store and later to the output. 

The Output function (Figure 7) comprises 
the Output Store, Code Converters , Buffer Com- 
mutator , and Output Buffers. These elements 
are the direct counterpart of the corresponding 
input elements. 

Messages to and from magnetic tapes as- 
sociated with local subscriber Electronic Data 
Processing Equipment, are accommodated through 
one of a series of Tape Station Converter Units. 
These units provide the needed signal and code 
conversion. 



Figure 7 

MESSAGE PROTECTION 

Figure 8 illustrates the message protection 
and duality features in greater detail. Journal 
and Reference Magnetic Tapes are maintained 
under control of the Communication Data Proc- 
essor. The Reference Tape contains a copy of 
every message received by the center. Each 
message is tagged to make recovery easy. The 
Journal Tape contains an abbreviated record of 
each message that has been relayed by the center. 
These records include a discreet message tag 
corresponding to the tag in the Reference Tape, 
time of processing in and out, channels and 
channel sequence number, if used. 

The Ledger Balance subsystem maintains 
a file on a magnetic drum of all messages cur- 
rently passing through the center. Each message 
is logged as it enters and as it leaves. This file 
is continuously searched for messages that have 
been in the center too long. Any overdue messages 
found are directed to the operator for expediting. 

CONTINUITY OF OPERATION 

This equipment is intended for communication 
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use and therefore requires much higher standards 
of reliability than are currently available in data 
processing equipment. Although transistor cir- 
cuitry of proven reliability is being used in the 
construction of the switching center equipment, 
an occasional failure can be expected. 

Elements of the system that are common to 
all channels are duplicated to provide back-up 
capability. One Communication Data Processor 
has enough capacity to handle all the switching 
center channels and normally carries the entire 
traffic load. A second processor is provided 
which is kept in idling status — exercising and 
performing subordinate tasks for the operating 
unit. Both processors have access to the high 
speed memories of the other, as well as all 
input/output stores, magnetic drums and mag- 
netic tapes. The operating Communication Data 


Processor continuously updates address lists in 
the idling machine so it is always ready to assume 
the traffic load. 

No automatic back up is provided for those 
elements whose failure would effect only a single 
channel or a small group of channels. The buf- 
fers fall in this category. 

Figure 9 is a photograph of a scale model 
of a 50-channel AutoData Message Switching 
System. This equipment is presently being con- 
structed by RCA for use by the military. 

This system will provide interconnection 
between existing communication networks that 
are presently considered incompatible and will 
provide communication between computers on a 
fully automatic basis. 



Figure 8 


MODEL OF AUTOMATIC STORE & FORWARD MESSAGE SWITCHING SYSTEM 



Figure 9 















PRODUCTION OF MAGAZINE LABELS 
BY THE VIDEOGRAPH PROCESS 
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By 

B. H. Klyce 
Technical Coordinator 
Subscription Service Division 
Time Incorporated 
Chicago, Illinois 


Abstract 

System considerations point out the feasibility 
of using high density master file tapes in the prepara- 
tion of subscriber labels for magazines, thus eliminat- 
ing the necessity of using the central computer complex 
for the preparation of edited tapes . The operation is 
completely automatic, and operators are required pri- 
marily to feed raw materials and file tapes into the 
machine and to remove the finished products. Produc- 
tion of labels at the rate of 131, 000 per hour is accom- 
plished by means of an electrostatic process in which 
the image is formed directly on a special paper. 

Several safety features insure reliable operation of the 
equipment. 

Introduction 

TIME Inc. maintains an active file of approxi- 
mately 10, 000, 000 subscribers to four of its domestic 
magazines - TIME, LIFE, FORTUNE and SPORTS 
ILLUSTRATED. A file of this nature is a dynamic one 
for there is constant activity in the form of changes of 
address, new subscriptions, renewals, inquiries, etc. 
Since 1946, the subscriptions have been processed with 
a semi -mechanical punched card system. Beginning 
in the fall of 1960, the punched card system will be 
"phased out" and a magnetic tape system will be install- 
ed, bringing a new degree of machine processing to 
subscription service. 

The installation of the new equipment is the 
result of nine years of intensive study of magnetic 
storage and processing techniques. As late as 1955, 
it would have cost considerably more to process sub- 
scriptions by magnetic means than by punched card 
means, but as equipment was improved, the cost kept 
decreasing until it became obvious in 1957 that a prac- 
tical magnetic system could be designed. 

Several computers were available. Feasibility 
studies were made and several proposals from com- 
puter manufacturers were evaluated with help from the 
Armour Research Foundation during the early part of 
1958. Each computer had its advantages and disadvan- 
tages but all of them had one thing in common — no sat- 
isfactory solution to the problem of printing labels . 
While 10, 000, 000 small magazine labels can be printed 


J. J. Stone 

Manager, Electronics Engineering 
and Research Department 
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Chicago, niinois 


on standard tabulating type computer output printers, 
this solution is a compromise at best and is- very ex- 
pensive. 

The label printing operation itself is an ideal 
application for special purpose equipment. The work 
load is almost constant from week to week, the form 
of the label shown in Fig. 1 is constant for long periods 
of time, and the paper on which the label is printed 
seldom changes . Therefore, rather than bend the 
system around to fit existing equipment which was de- 
signed for another purpose, separate proposals were 
solicited for new equipment designed primarily to print 
labels . 

This equipment, of course, must cooperate 
with other elements in the overall system in order to 
achieve optimum results. The basic source of infor- 
mation for the label printing job is the master file of 
subscriber data. Activity to the file in the form of new 
subscriptions, renewals, changes of address, etc., are 
accumulated on a weekly basis and the file is updated 
just before the labels are to be printed. This insures 
that the labels are as "fresh" as possible. The arrange- 
ment of the file is geo-alpha, each record being identi- 
fied by a code that is derived from the subscribers 
name and address . The code is constructed with the 
city-state portion major, followed by last name, first 
initial, street name, and house number. 

Optimum file maintenance procedures dictate 
that certain inactive subscriber records be retained in 
the file for several months, including recently expired 
subscriptions and the old address in a change of ad- 
dress activity. An analysis of the list shows that on 
the average there is about one inactive record for every 
three active records. 

On magnetic tape the records will appear as 
follows (Fig. 2): 

(a) They will be grouped in blocks of some- 
where between 10 and 20 records per block 
with 14 being the most likely choice in 
order to achieve an approximate balance 
between tape time and compute time. 
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(b) In order to conserve tape and thereby to 
reduce process time, the city-zone -state 
portion of a subscriber's name and address, 
and the code for that city, will not be 
carried in the individual record but in a 
header record that appears at the beginning 
of each new city-zone -state grouping. 

Header record and subscriber record formats 
are similar (Fig. 2b and 2c) and conform to the re- 
quirements of the IBM 7070 computer on which the file 
will be updated. Statistical information which is rela- 
tively constant is placed at the beginning of the record 
while, variable address information is placed at the end, 
allowing maximum conservation of tape. In other words, 
the record is terminated with the end of the address in- 
formation and short addresses mean short records, 
long addresses mean long records. Specifically: 

(1) The record always begins with a delta. 

(2) The next character, called the indicative 
character, identifies the record as active, 
inactive, or header, as well as a special 
record classification called Suppress Header 
which is used for subscriber records that 
contain special city-state information, there- 
by, making the header unnecessary when 
printing labels . 

(3) A second delta signals the beginning of the 
alphabetical portion of the record. 

(4) After the second delta, the first 16 digits are 
part of the code that identifies the subscriber 
record. (The next four digits are ignored.) 

(5) The name line begins with the 21st digit. 

. The first address line begins with the 41st 
digit. The second address line begins with 
the 61st digit. 

(6) The record may end with a record mark 
symbol in any multiple of five character 
positions after the twentieth position, or 
with no record mark if the record is a 
maximum length record. 

(7) The first 10 digits following the second 
delta of the header record are part of the 
code line. 

(8) The city-state line always begins with the 
11th character. 

Because of the importance of correctly identi- 
fying the header record, several checks are available 
in addition to the indicative character. 


(1) The first character following the second 
delta of the subscriber record will always 
be the initial letter of one of the months of 
the year, and the first character following 
the second delta of the header record will 
never be one of these eight characters . 

(2) The header information following the second 
delta will always be exactly 30 characters 
plus a record mark word, no more and no 
less. 

In addition to the format characteristics listed 
above, there is available a programmed signal in the 
form of a special record called an "End of Roll" record. 
This record appears to the printer as an active record 
and the information contained in it, roll identification, 
number of subscribers in the roll, etc., is printed as 
though it were a label. It's distinguishing character- 
istic is a series of five Z's following the second delta. 
This is a signal to the printer to stop printing on that 
roll and to start a new roll. As a check on the perfor- 
mance of the printer, a count is kept of the active re- 
cords going through the printer and this count is printed 
beside the computer count shown in the "End of Roll" 
record. 

This arrangement of records is advantageous 
from the file maintenance standpoint, and it also makes 
feasible the printing of labels directly from the master 
file . The manner in which the problems of printing 
control and editing were solved is described in the 
following sections . 

General Description of Label Printer System 

A. B. Dick Company undertook the task of de- 
veloping and constructing a label printer system meet- 
ing TIME Inc. 's requirements. From the outset this 
system was conceived to be a production machine ful- 
filling a vital role in the daily production of magazine 
address labels. As such, it has been designed to 
operate directly from the magnetic tape master file 
records without the necessity of using valuable computer 
time for the preparation of a pre-edited printing tape. 

It produces and delivers completed address label rolls 
ready for use by the magazine printers in their standard, 
label-affixing machines . 

The printer operates at a basic rate of 6100 char- 
acters per second. This rate increases to approximately 
7500 characters per second during the key line operation. 
These printing rates permit 36 labels per second to be 
produced and, with the automatic paper processing de- 
vices incorporated, permit the machine to produce five 
million labels per week during a standard 40 hour shift. 
Thus, two machines adequately meet the printing load 
faced by TIME Inc. with either machine capable of turn- 



ing out the entire production if operated more than a 
single shift. Each machine provides standby service 
for the other. 

To achieve this function requires two major 
units as shown in Fig. 3. The first, or lefthand one, 
of these units, 1) reads the magnetic tape records, 

2) edits the information read, 3) stores the printing 
information until used by the printer, and 4) formats 
the printing data before transmission to the printer. 

This unit is called the Tape Reader -Buffer Unit, or 
TRBU. 

The second unit, the one on the right, converts 
the digital codes received into printing signals and pro- 
duces the magazine address labels . This unit is the 
Videograph Address Label Printer. This unit operates 
continuously with provisions for automatically changing 
paper supply rolls and for cutting off completed address 
label rolls and switching to a new spindle in response 
to input control pulses . These cutoff and transfer 
operations proceed without operator intervention and 
without machine stoppage. 

The Videograph Process 

Before proceeding with a description of the 
address label printer consider briefly the printing 
process itself. Fig. 4 and 5 diagrammatic ally illus- 
trate this. 

As shown in Fig. 4, a specially constructed 
cathode ray tube produces an electrostatic image on the 
treated surface of a moving paper web. The face of 
this electrostatic printing tube (or EPT) consists of a 
matrix array of small, closely-packed wires extending 
from the inner vacuum through the glass to the outside. 
By directing the electron beam to a point on this array 
and turning it on, a current is caused to flow through 
the wires selected to the surface of the paper, produc- 
ing a charged area on the paper that is capable of later 
development. Thus, through control of the deflection 
signals, and suitable intensity modulation of the elec- 
tron beam, any desired electrostatic image may be 
produced within the boundaries of the wire matrix array. 

This printing process requires two characteris- 
tics of the paper. First, the surface must be capable 
of accepting and retaining an electric charge . This is 
accomplished by coating the surface with a thin layer 
of dielectric material. Second, the base, or body of 
the paper must be conductive to create a suitable ground 
plane near the imaged surface. This is accomplished 
by impregnating the paper with suitable materials . The 
resulting paper, therefore, does not depend upon photo - 
conductive or other unusual properties, and is, inci- 
dentally, easily and economically produced. 

Following the printing process, the latent image 
is developed as shown in Fig. 5 by dusting with a mix- 
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ture of iron powder and a black toner dust. 

The toner particles, in such a mixture, adhere 
to the negatively charged regions on the paper surface 
and are conveyed with the paper to the fixing station. 

The iron particles serve to impart a charge to the toner 
causing them to more readily adhere to the image. The 
iron becomes, through the charge interchange, opposite- 
ly charged and is repelled by the imaged areas . The 
iron, therefore, serves only as a carrier for the toner 
and is not used up in the process of development. 

Following development, application of heat to the 
developed image causes the thermoplastic toner parti- 
cles to fuse and attach themselves to the paper surface. 
The image is thus made permanent and will not smudge 
as a result of later handling. 

Character Generation 

In order to use the electrostatic printing tube for 
the printing of alphanumeric characters, some means 
must be provided for the conversion of digitally coded 
information into video-type signals. Many approaches 
were possible due to the flexibility of the printer tube. 

The means selected utilizes a monoscope tube 
such as shown in Fig. 6. This tube is basically a 
cathode ray tube with an electrostatic deflection system. 
At the screen end of the tube a circular aluminum disc 
replaces the phospher as the target for the electron 
beam. Immediately in front of the target is a somewhat 
more positive electrode for the collection of the secon- 
dary electrons emitted from the target. 

Operation of the monoscope depends upon the 
secondary emission characteristics of the target sur- 
face. When the beam strikes the aluminum surface of 
the target, a secondary emission ratio of approximate- 
ly two results . This causes an electron current to flow 
from the external circuitry into the target. This con- 
dition would be reversed should the surface be covered 
with a carbon-based ink having a secondary emission 
ratio of somewhat less than one. 

The target for the monoscope used contains a 
square array of 64 characters on its front surface. 

Each character is etched into the aluminum surface. 

The remaining surface is covered with a carbon-based 
ink to form the background. As the primary beam 
scans a single character in a raster-like manner, the 
resultant current into the external circuit represents 
a video signal describing the character. By scanning 
the printing tube with an identical raster and intensity 
modulating the beam with the video thus produced, an 
electrostatic image of the character on the monoscope 
may be created on the videograph paper. 

Character selection signals originate in a highly 
stable circuit generating one out of eight voltage levels 
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for both the horizontal and the vertical deflection plates . 
An input, six-bit digital code, representing the desired 
character, controls the selection circuits in such a 
manner that the first three bits of this code select the 
vertical position, and the second three bits select the 
horizontal position of the character to be scanned. By 
this means, the positioning of the characters on the 
monoscope target decodes the input and eliminates the 
need for further logical circuitry. 

Since no cutouts or stencils of the letters are 
involved, and since the plates may be easily produced 
from original artwork by photo-etching techniques, al- 
most any type font may be chosen. In fact the type font 
may be varied by switching tubes to meet special print- 
ing needs in the future. 

Electronic System of Printer 

Using the printing process and the character 
generator described above, the electronic system of 
the Videograph Label Printer produces an electrostatic 
image of the labels on a continuously moving web of 
paper. The operation of this system is shown in the 
block diagram of Fig. 7. 

Digital signals from the TRBU trigger the char- 
acter generator once for each input character. In this 
unit the selection circuits select the character to be 
printed and the raster-scan circuit causes this char- 
acter to be scanned. The same raster signals are 
transmitted to the vertical deflection coil of the printer 
tube and through the horizontal deflection circuitry to 
the horizontal coil. In addition, an unblanking signal 
and the resultant video signal modulate the beam of the 
printing tube so as to produce the latent image of the 
scanned character. 

The raster signals used for the character scan 
consist of a 200 kc vertical sine wave that is continu- 
ously applied to both the monoscope and the printer tube. 
During the generation of a character, a 100 micro- 
second horizontal sawtooth signal produces the slower 
horizontal scan in the monoscope and controls the hori- 
zontal movement of the beam in the printer tube. The 
resultant horizontal signal for the printer tube, there- 
fore, consists of a stepwise signal combined with a 
sawtooth. The stepwise component provides the char- 
acter by character separation, and the sawtooth con- 
trols the writing of individual characters. The hori- 
zontal circuitary is reset for the beginning of each line 
by the pulse supplied from the TRBU. 

The First Line Pulse modifies the horizontal 
deflection circuitry so as to compress the key line as 
shown in Fig. 1. This compression occurs as a result 
of reducing the magnitudes of the horizontal sawtooth 
and step signals in the horizontal deflection circuitary. 
The compression circuits are reset by the next 
"Beginning of Line" pulse from the TRBU. 


The motion of the paper controls the printing of 
labels so that each label will be accurately centered 
between holes. The paper control circuitry, therefore, 
produces a label demand pulse and transmits this pulse 
to the TRBU to initiate the printing of each new label. 

At the end of each label roll, as determined by infor- 
mation from the master file tapes, an "End of Roll" 
pulse is sent to the printer from the TRBU. This 
causes an interruption in the transmission of the label 
demand pulses for 2. 5 seconds in order to create a 
blank strip of paper between label rolls . At an appro- 
priately later time the web is cut and caused to start 
rewinding on a separate spindle. 

Paper Processing System 

The mechanical paper processing system consists 
of three basic modules. These are shown in Fig. 8. 
From left to right these are 1) the Unwind module, 

2) the processing module and 3) the rewind module. 

The unwind module contains two supply rolls of 
paper, a flying splice mechanism and a supply elevator. 
The upper supply roll normally feeds paper into the 
printer while the lower roll is in standby. When the 
upper roll is depleted, as detected by a feeler arm and 
cam, the splicer stops the flow of paper. During the 
splice operation, the supply elevator supplies paper to 
the printer so that actual printing does not stop. After 
the paper motion stops at the splicer, the old web is 
cut off and the leading edge of the new web attached to 
the trailing edge of the old. The web is then released 
and the new roll commences to provide the system. The 
entire unwind mechanism then revolves, bringing the 
new roll to the top and placing the empty spindle on the 
bottom for loading of a new roll of paper for standby. 

In the central, or processing, module, the paper 
passes in front of the EPT where the electrostatic 
image is created in its left surface. It then proceeds 
through the developer and around the fuser unit. From 
this point it proceeds to the rewind module . The opera- 
tions of the printer tube have already been described. 
The developer unit conveys the iron powder -toner mix 
to the paper surface to develop the image . The fuser 
uses rf to create a corona discharge to the paper sur- 
face causing a very rapid surface heating for the fusing 
operation. 

In the rewind module, there is a primary paper 
drive -punch unit, a rewind elevator and a six spindle 
rewind assembly. The primary drive -punch unit uses 
hardened dies in the drive drum meshing with soft steel 
punches on an adjacent drum. These elements of a 
rotary punch are on one inch centers and create the 
holes in the label strips as shown in Fig. 1. In addi- 
tion, this unit generates pulses in synchronization with 
the hole punching operation to initiate the printing of 
each label. Since the distance from this drive unit back 
to the printer tube remains fixed, this exactly locks the 



position of the printed labels with respect to the holes 
punched. 

The paper web normally travels over the left 
most spindle of the rewind station and is wound on the 
lower left spindle as shown. At the end of a roll, a 
cutting knife opposite the left most spindle cuts the web 
and a roller, together with a vacuum in the spindle, 
causes the rewinding to commence on the left most 
spindle. The mechanism then rotates 60* counter- 
clockwise, returning the system to its original state. 
The completed roll is conveyed to the lower right posi- 
tion where it can be removed by the operator and a new 
core placed on the spindle . 

Operation of the Tape -Reader -Buffer Unit 

From the preceding it may be seen that the 
printer operation proceeds in a most conventional 
manner but at a very high rate of speed. It could just 
as easily be recording data directly from the computer 
as an "on line" device or could be printing the text of 
the congressional record. However, the TRBU converts 
this conventional printer to a TIME Inc. printer system. 
This unit contains the editing, logic, and buffer circuits 
necessary to permit this printer to be driven directly 
from the master file records produced by the computer. 

With the tape organization described previously, 
the TRBU's input control circuitry can, as shown in the 
block diagram of F ig. 9, edit the incoming information 
from the tape . Information read a block at a time from 
tape 1 in the figure, passes sequentially through SI and 
the two character storage to S2. While in this limited 
storage, the characters are examined by the input logic 
control circuits in order to check the parity of the char- 
acters as read and to control the setting of the switch, 
S2, at the input to the main buffer storage unit. 

Switch S2 normally is in the lower position so 
that no information enters the buffer. When the first 
delta symbol is received from the tape, the input logic 
circuit examines the numerical digit immediately fol- 
lowing it. If the record is a non -printing subscriber 
record, the control circuitry leaves S2 in the open 
position throughout the remainder of that record. If 
the record is that of an active subscriber, or a header, 
S2 connects with the two character storage input at the 
second delta symbol in the record and enters into stor- 
age the second delta symbol together with the portion 
of the record following it. S2 immediately returns to 
the open position at the instant the next delta symbol, 
signifying the start of the next record, is detected by 
the input logic circuitry. 

A parity error detected by the input logic cir- 
cuitry is counted and added to the total count of such 
errors stored in the 144 character buffer unit. In 
addition, S2 and the special symbol generator inserts 
the code for a question mark for the character misread. 
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Normally the detection of a parity error does not stop 
the machine since a few such errors will still permit 
delivery of the magazine. A label is a very redundant 
item, especially when coupled with the postman's 
memory. Such an error will stop the machine, how- 
ever, when its frequency exceeds a present amount, or 
when it occurs at a critical point where confusion is 
likely in the handling of the header. 

The 2184 character main storage buffer has a 
minimum capacity of approximately 26 combinations of 
header and subscriber records . When the number of 
records in storage drops below approximately 12, the 
input tape drive control causes the tape unit in use to 
read the next block. By this means the main buffer is 
kept loaded and the tape reading rate is under the con- 
trol of the printer and its operations . At the end of the 
first tape, SI automatically switches to Tape 2 and 
operation continues as before while the operator reloads 
Tape 1 with a fresh tape. 

The time required to read a block of information 
from the input tape is approximately 50 milliseconds . 

The time to empty the main buffer by printing opera- 
tions is approximately 300 milliseconds. Thus, 
approximately 6 blocks of information could be read 
from the input tape before the buffer would be emptied 
by the printer. As a result, there could be a series of 
up to 84 consecutive inactive records without causing 
the printer to print a blank label because of insufficent 
input information. However, no useful information is 
lost as a result. Operational studies of the master 
files at TIME Inc. indicate that such a series of inactive 
records will almost never occur. 

At the output of the 2184 character buffer, S3 and 
S4, under the control of the output logic circuitry, de- 
termine the sequence of characters coupled to the 
printer. For the printing of a subscriber address, the 
data denoting the specific subscriber constitutes the 
subscriber record and is obtained for the printing oper- 
ations directly from the main buffer. For this the 
switch S3 is positioned in its center position. The last 
seven characters of the key line and the last address 
line are associated with the city -zone -state and are 
part of the header record as stored in the 144 character 
buffer in the block diagram. For the printing of these 
characters, S3 and S4 are in their lower-most positions. 
This causes the output of the smaller buffer to be de- 
livered to the printer and at the same time to circulate 
back into the input of the buffer so that the same header 
information will be available for the printing of the next 
label. The two spaces and the "Beginning of Town" 
marks are supplied from the special symbol generator 
with S3 in its uppermost position. The "Beginning of 
Town" marks are only placed on the first label in each 
new city -zone -state series. 

The delivery of data to the printer commences 
with the receipt of a "Label Demand” pulse from the 
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printer and continues at the character writing rate of 
6100 characters per second until the label is completed. 
Completion of a label is normally determined in one of 
three possible ways. These are: 

(1) detection of a delta symbol out of the main 
buffer. 

(2) detection of a record mark out of the main 
b u ffer . 

(3) completion of a full label character count 
by counters in the output logical circuitry. 

The occurrence of any one of these causes the 
output to switch to the 144 character circulating buffer 
for the printing of the city-zone -state line. 

In the event that other than a delta symbol term- 
inates the printing, the main buffer advances to the next 
delta symbol. The digit immediately following this 
delta is then examined by the output logic circuitry to 
determine the type of this record. If this record is a 
header record, S4 switches to its upper position and 
the new header is rapidly transferred to the circulating 
buffer to replace the old. If this record is a subscriber 
record, the operations stop until receipt of the next 
"Label Demand" pulse from the printer. If this record 
is an "End-of-Roll" summary record, the output logic 
circuitry causes the next record, a summary for the 
label roll just completed, to be printed upon receipt of 
the next "Label Demand" pulse and also: 

(1) causes the error and label count registers 
in the 144 character buffer to be printed 
with the summary record, and 

(2) sends an "End-of-Roll" pulse to the printer 
following the printing of the summary 
record. 

The other signals shown in Fig. 9 are for the 
purpose of controlling the printer’s operation in the 
manner previously described. It may be mentioned at 
this point that the clock controlling the printing rate is 
part of the output logic circuitry of the TRBU and for 
this reason the printer is under control of the TRBU 
once printing commences . 

Summary 

Little has been said of the built-in checks in 
both the TRBU and in the printer to assure the operator 
that the entire system has and is working properly. 
Space does not permit a discussion of these features. 
Suffice it to mention that they do exist and will sound 
alarms or stop the machine depending upon the severity 
of the malfunction. This system differs, however, 
from many other systems in regard to the treatment of 
errors. It is essential to the welfare of the magazine 


that each subscriber obtain his regular copy. It is not 
necessarily essential that every label be perfect to 
obtain this result. Thus, certain errors are permitted 
in the operation of the system and a count is maintained 
of the number of such occurrences . The system stops 
only if the number of such events becomes too numer- 
ous, or until the operators feel that the system should 
be serviced. 

The principle advantages of the Videograph print- 
ing and character generation processes are their speed 
and flexibility. Neither the printing tube nor the char- 
acter generator operate near their limits in producing 
the 36 labels per second. Each could function at twice 
the present rate although little would be gained in TIME 
Inc. 's application by doing so. Their label printing 
requirements can be met, together with anticipated 
growth, with two machines operating on a single shift 
and a portion of a second shift. 

The TRBU makes possible the operation of the 
label printer direct from the information contained in 
the master file records . This eliminates the need for 
using computer time in the preparation of edited tapes 
for printing purposes. This saving is an operating cost 
saving and its magnitude more than offsets the addition- 
al cost of the TRBU . 

The flexibility of the system and the process 
makes possible the compression of the key line, quick 
changes in type font or input digital coding, easy rever- 
sal of the printed image for heat transfer applications, 
and a minimum of circuitry capable of rapid adaptation 
to future changes and applications . Thus, the system 
developed amply meets the requirements set forth by 
TIME Inc. and, in addition, provides them with a 
system capable of expansion and modification to meet 
their future needs . 

Acknowledgements 

As with many modern developments, the Video - 
graph process and, specifically, the Videograph Label 
Printer System described in this paper, has been the 
result of a cooperative effort on the part of many com- 
panies and individuals . In particular, mention should 
be made of the contributions of Stanford Research 
Institute in the development of the process and many of 
the circuits under the sponsorship of A. B. Dick Co. 
and Telemeter Magnetics, Inc. for the design and con- 
struction of the TRBU. 



377 

12.3 


O 

MAR 59 ROT 99999 C98L87 22 20 
MRS. CHARLES ROOT 
HERRIN ILL 


o 

SEP 58 MTC RN457W98L87 01 1 1 
WILLIAM S MITCHELL 
2457 RONALD STREET 
CHICAGO, ILL 

o 

NOV 62 GAN LK850J98L87 0111 
MR. JOHN T GRANT 
850 LAKE SHORE DRIVE 
LAKE SHORE CLUB 
CHICAGO 1 1. ILL 

o 

MAY 60 SAA 0I303B98L87 01 I I 
BILL SACAR 
303 OHIO ST 
CHICAGO II.ILL 

_ O 



Figure 1 

FORMAT OF MAGAZINE LABELS 
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Figure 2 

MAGNETIC TAPE & RECORD ORGANIZATION 
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Figure 6 

VTDEOGRAPH CHARACTER GENERATOR TUBE 



Figure 7 

VIDEOGRAPH LABEL PRINTER-ELECTRONIC BLOCK DIAGRAM 
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Figure 9 

TAPE READER & BUFFER UNIT-BLOCK DIAGRAM 















